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In  this  paper  we  derive  a  model  of  aggregate  investment  that  builds  from  the  lumpy  microeco- 
nomic  behavior  of  firms  facing  stochastic  fixed  adjustment  costs.  Instead  of  the  standard  sharp  (5,  s) 
bands,  firms'  adjustment  policies  take  the  form  of  a  probability  of  adjustment  (adjustment  hazard) 
that  responds  smoothly  to  changes  in  firms'  capacity  gap.  The  model  has  appealing  aggregation 
properties,  and  yields  nonlinear  aggregate  time  series  processes.  The  passivity  of  normal  times  is, 
occasionally,  more  than  offset  by  the  brisk  response  to  large  accumulated  shocks.  Using  within  and 
out-of-sample  criteria,  we  find  that  the  model  performs  substantially  better  than  the  standard  linear 
models  of  investment  for  postwar  sectoral  U.S.  manufacturing  equipment  and  structures  investment 
data. 
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1     Introduction 

Minor  upgrades  and  repairs  aside,  investment  projects  at  the  plant  level  are  intermittent 
and  lumpy  rather  than  smooth.  This  is  starkly  documented  in  Doms  and  Dunne  (1993). 
They  use  the  Longitudinal  Research  Datafile  to  study  the  investment  behavior  of  12,000 
continuing  (and  large)  U.S.  manufacturing  establishments  for  the  seventeen  year  period  . 
from  1972-1988,  and  find  that:  (i)  more  then  half  of  the  establishments  exhibit  capital 
growth  close  to  50  percent  in  a  single  year,  and  (ii)  over  25  percent,  and  perhaps  as  much 
as  40  percent,  of  an  average  plant's  gross  investment  over  the  seventeen  year  period  is 
concentrated  in  a  single  year/project.^ 

Since  this  basic  feature  of  microeconomic  data  is  seldom  considered  in  empirical  in- 
vestment equations,  it  perhaps  should  come  at  no  surprise  that  success  in  estimating  and 
testing  investment  equations  is  so  rare.''  At  a  broad  level,  our  goal  in  this  paper  is  to 
develop  and  test  a  framework  to  study  the  dynamic  behavior  of  aggregate  investment,  sub- 
ject to  the  constraint  that  it  builds  up  from  microeconomic  units  generating  the  lumpy  and 
intermittent  pattern  observed  in  microeconomic  data. 

Achieving  such  a  goal  requires  three  methodological  ingredients:  (i)  A  microeconomic 
model  of  lumpy  adjustment;  (ii)  an  aggregation  procedure;  and  (iii)  an  estimation  and 
testing  method  which  is  not  only  consistent  with  (i)  and  (ii),  but  also  able  to  highlight  the 
impact  of  the  proposed  microeconomic  model  on  aggregate  dynamics. 

As  in  the  standard  {S,s)  literature,  our  microeconomic  model  generates  lumpy  behavior 
through  the  presence  of  a  fixed  cost  of  adjusting  the  firm's  capita!  stock.  Unlike  this 
literature,  the  fixed  cost  is  random  so  the  '"inaction  range"  is  no  longer  fixed  over  time  (and 
across  firms).  The  optimal  policy  still  takes  a  simple  form  under  standard  assumptions 
about  the  stochastic  process  of  exogenous  variables:  let  z  denote  the  log-difference  between 
a  firm's  actual  and  frictionless  (i.e.,  in  the  absence  of  adjustment  cost)  stock  of  capital,  and 
w  be  a  random  variable  indexing  the  adjustment  cost  faced  by  the  firm  at  some  point  in 
time,  with  distribution  G{u).  The  solution  to  the  firm's  problem  yields  a  function  ^[z) 
that  represents  the  ma.ximum  realization  of  u  for  which  a  firm  with  imbalance  z  chooses 
not  to  adjust.  For  any  smaller  u>,  firms  adjust  fully.  Removing  conditioning  on  lj,  on  the 
other  hand,  yields  an  adjustment  hazard  function  A{z)  which  describes  the  probability  thsit 


Since  plants  entry  is  excluded  from  their  sample,  these  statistics  are  likely  to  represent  lower  bounds  on 
the  degree  of  lumpiness  in  plants'  investment  patterns. 

See  Cliirinko  (1994)  for  a  survey  of  the  empirical  investment  literature. 


a  firm  with  imbalance  z  adjusts.  Since  it  varies  smoothly  with  2,  this  probabilistic  (5,  s)- 
type  rule  is  more  amenable  to  aggregation  than  the  standard  fixed-bands  (5,5)  model  and, 
more  importantly,  has  the  virtue  of  nesting  a  wide  variety  of  models.  At  the  extremes, 
when  G(y))  degenerates  into  a  spike  we  recover  the  (5,5)  model,  while  when  it  becomes  a 
distribution  with  plenty  of  mass  at  very  low  values  of  w  and  the  remaining  mass  at  very 
high  adjustment  costs,  we  approximately  recover  a  model  with  linear  aggregate  dynamics 
(the  standard  partial  adjustment  model). 

Firms'  actions  are  not  perfectly  synchronized.  On  one  hand,  at  any  point  in  time  adjust- 
ment costs  differ  across  firms.  On  the  other,  differences  in  initial  conditions,  idiosyncratic 
shocks,  and  previous  actions,  yield  a  non-degenerate  cross  sectional  density  of  capital  imbal- 
ances, f{z,i),  at  all  times.  Aggregation  proceeds  in  two  steps,  both  under  the  assumption 
of  a  large  number  of  firms:  First,  within  each  -,  the  microeconomic  adjustment  hazard 
now  represents  the  fraction  of  units  with  that  imbalance  that  choose  to  adjust  at  any  given 
moment  in  time.  Second,  to  obtain  aggregate  investment  we  integrate  these  adjustments 
across  ~,  using  as  measure  the  current  cross  sectional  density.  In  order  to  describe  the 
dynamic  path  of  aggregate  investment  wc  characterize  the  path  o{  f{z,t)  which,  under  our 
assumptions,  is  Markovian  with  a  transition  operator  that  depends  on  the  realization  of 
aggregate  shocks. 

We  make  (fairly  flexible)  distributional  assumptions  about  aggregate  shocks  and  esti- 
mate the  model  by  Maximum  Likelihood  using  (aggregate)  two-digit  U.S.  manufacturing 
investment/capital  ratios  for  the  period  1918-1992.  We  find  clear  evidence  in  favor  of  our 
generalized  {S,s)  model,  both  in  terms  of  within  sample  criteria  and  out-of-sample  predic- 
tive power.  Our  structural  interpretation  of  these  nonlinearities  indicates  that  the  fraction 
of  firms'  capital  subject  to  fixed  costs  is  large,  and  that  these  costs  are  also  large.  Although 
important  for  both,  these  features  are  more  pronounced  for  structures  than  equipment. 
When  compared  with  standard  linear  models,  the  forecasting  accuracy  of  the  model  we 
postulate  is  substantially  improved. 

One  of  the  main  mechanisms  by  which  aggregate  dynamics  generated  by  (5,5)  type 
models  differs  from  their  linear  counterpart,  is  that  the  number  of  active  firms  changes 
over  the  cycle  —  a  point  emphasized  by  Bar-Ilan  and  Blinder  (1992).  Doms  and  Dunne's 
(1993)  confirm  the  importance  of  this  mechanism  by  showing  that  the  number  of  plants 
going  through  their  primary  investment  spikes,  rather  than  the  average  size  of  these  spikes, 
tracks  closely  aggregate  manufacturing  investment  over  time.  Consistently,  and  depending 
on  the  specific  sequence  of  events  preceding  current  events,  the  nonlinear  model  we  estimate 


has  the  potential  to  generate  brisker  expansions  than  its  linear  counterparts  has.  It  is  also 
this  feature  which  largely  explains  its  enhanced  forecasting  properties. 

Beyond  the  empirical  findings  on  investment  and  its  integrative  nature,  this  paper  has 
two  specific  methodological  contributions  to  the  new  literature  on  non-convex  adjustment 
costs  and  lumpy  actions. 

On  the  microeconomic  side,  there  have  been  several  developments  on  models  of  lumpy 
and  intermittent  adjustment  (the  {S,s)  literature).^  As  we  discussed  above,  here  we  extend 
these  models  so  the  adjustment  trigger  barriers  vary  randomly  across  firms  and  for  a  firm 
over  time.  This  modification  is  a  first  step  toward  introducing  the  realistic  and  empirically 
important  feature  that  units  do  not  always  wait  for  the  same  stock  disequilibrium  to  adjust, 
and  that  adjustments  are  not  always  of  the  same  size  across  firms  and  for  the  same  firm 
over  time,  while  preserving  a  fairly  parsimonious  aggregation  setup. 

More  recently,  there  have  also  been  developments  of  empirical  models  of  aggregate 
dynamics  with  heterogeneous  microeconomic  units  adjusting  intermittently.^  Econometric 
implementation  of  these  models,  however,  has  required  observing  (or  estimating  separately 
in  an  often  debatable  first  stage)  a  measure  of  the  exogenous  component  of  the  aggregate 
driving  force.  Our  nonlinear  time  series  procedure  does  not  require  the  first  stage;  it  only 
requires  information  on  the  aggregate  investment  series  itself  and  on  the  generating  process 
of  the  driving  force  (but  not  its  realization).  Somewhat  analogously  with  the  standard 
procedure  of  estimating  convex  adjustment  costs  parameters  from  the  first  (or  higher) 
order  serial  correlation  of  investment,  we  learn  about  more  complex  lumpy  adjustment  cost 
functions  from  the  structure  of  aggregate  investment  lags  and  their  changes  over  time. 

The  next  section  presents  the  basic  model.  It  is  followed  by  section  3,  which  describes  the 
econometric  method  and  presents  our  main  empirical  results.  Conclusions  and  extensions 
are  discussed  in  section  4.  Several  technical  appendices  follow. 


See  Harrison,  Sellke  and  Taylor  (1983)  for  a  technical  discussion  of  impulse  control  problems.  For  a  good 
survey  of  the  economics  literature  —  although  with  an  emphasis  on  models  where  investment  is  infrequent 
but  not  lumpy  —  see  Dixit  and  Pindyck  (1994).  A  model  more  closely  related  to  a  special  case  of  ours  is 
Grossman  and  Laroque's  (1990)  model  of  consumer  durable  purchases. 

^Blinder  (1981),  Bar-llan  and  Blinder  (1992)  and  Lam  (1992)  look  at  data  on  inventories  (the  first 
one)  and  consumer  durables  (the  other  two)  under  the  organizing  principles  of  (5,5)  models.  Bertola  and 
Caballero  (1990)  and  Caballero  (1993)  provide  a  structural  empirical  framework  and  estimate  {S,s)  models 
for  consumer  durable  goods.  Bertola  and  Caballero  (1994)  implement  empirically  an  irreversible  investment 
model  where  microeconomic  investment  is  intermittent  but  not  lumpy.  Caballero  and  Engel  (1992a,  1993a, 
1993b)  estimate  aggregate  models  of  employment  and  price  adjustments  when  microeconomic  units  follow 
more  general  (probabilistic)  microeconomic  adjustment  rules  but,  contrary  to  the  current  paper,  they  do 
not  derive  these  rules  from  a  microeconomic  optimization  problem. 


2     The  Basic  Model 

2.1      Overview 

We  model  a  sector  composed  of  a  large  but  fixed  number  of  monopolistically  competitive 
firms.  Each  firm  faces  an  isoelastic  demand  for  its  differentiated  product,  which  is  produced 
with  a  Cobb-Douglas  constant  returns  technology  in  labor  and  capital.  Both  demand  and 
technology  are  affected  by  multiplicative  shocks  described  by  a  joint  geometric  random  walk. 
These  shocks  have  firm  specific  and  sectoral  (aggregate)  components  which  we  specify  later. 
We  work  in  discrete  time. 

The  sector  faces  infinitely  elastic  supplies  of  labor  and  capital.  We  choose  the  price  of 
the  latter  as  numeraire  and  let  the  wage  (relative  to  the  price  of  capital)  follow  a  geometric 
random  walk  process,  possibly  correlated  with  demand  and  technology  shocks.  Firms  can 
adjust  their  labor  input  at  will  but  suffer  a  loss  when  resizing  their  stock  of  capital.  Since 
our  aim  is  to  capture  firms'  infrequent  and  lumpy  investment,  we  assume  this  loss  takes  the 
form  of  a  fixed  cost;  which  can  be  interpreted  either  as  an  index  of  the  degree  of  specificity  of 
firms'  capital,  or  as  a  secondary  market  imperfection  if  machines  or  structures  are  replaced, 
or  as  a  reorganization  cost  associated  with  putting  new  capital  to  work.  In  order  to  capture 
some  of  the  time  series  and  cross  sectional  heterogeneity  in  these  fixed  costs,  we  let  the 
extent  of  the  loss  due  to  adjustment  vary  randomly  over  time  as  firms  may,  for  example, 
find  better  or  worse  matches  or  uses  for  their  old  machines,  or  may  face  reorganizations  of 
different  degrees  of  difficulty. 

As  in  standard  {S,s)  models,  tlie  resulting  microeconomic  policy  is  one  of  inaction 
interspersed  with  periods  of  large  in\'cstnicnt  or  disinvestment.  As  in  standard  search 
models,  at  each  point  in  time  the  firm  decides  whether  to  "accept"  the  currently  offered 
fixed  adjustment  cost  or  to  postpone  adjustment  and  draw  a  new  adjustment  cost  next 
period.  The  interaction  between  these  two  mechanisms  implies  that,  more  realistically 
than  in  standard  {S,s)  models,  the  size  of  adjustments  varies  both  across  firms  and  over 
time  for  the  same  firm.  During  a  given  time  period,  firms  with  identical  shortages  or 
excesses  of  capital  act  differently.  Over  time,  the  same  firm  reacts  differently  to  similar 
disequilibria  in  its  stock  of  capital. 

Intuitively,  the  largest  adjustment  cost  a  firm  is  prepared  to  tolerate  without  adjusting 
its  capital  stock  decreases  with  the  extent  of  its  capital  stock  imbalance.  If  the  distribution 
of  adjustment  costs  is  non-degenerate,  this  implies  that  the  probability  that  a  firm  adjusts 
for  a  given  disequilibrium  —  a  concept  we  describe  as  the  firm's  adjustment  hazard  — 


increases  smoothly  and  monotonically  with  the  firm's  disequilibrium  in  its  stock  of  capital.^ 
Since  we  assume  the  number  of  firms  is  large  and  adjustment  costs  are  independent 
across  firms,  the  adjustment  /iazarc?  described  above  characterizes  actual  sectoral  investment 
at  each  point  in  time.  Given  firms'  capital  imbalances  at  the  beginning  of  a  period,  the 
fraction  of  units  resizing  their  stock  of  capital  is  determined  by  the  adjustment  hazard. 
Sectoral  investment  is  the  sum  of  the  products  of  the  adjustment  hazard  and  the  size  of 
the  investment  undertaken  by  those  firms  that  decide  to  adjust.  Equivalently,  it  is  the  sum 
of  the  expected  investment  by  firms,  conditional  on  their  capital  stock  imbalances  before 
adjusting  their  capital  stock. 

Sectoral  investment  depends  critically  on  the  number  of  firms  at  each  position  in  the 
space  of  capital  imbalances,  thereby  motivating  our  focus  on  the  cross  sectional  density  of 
disequilibria.  The  dynamics  of  sectoral  investment  are  determined  by  the  evolution  of  this 
density.  The  path  of  this  density  is  driven  by  the  interaction  of  sectoral,  firm  specific,  and 
adjustment  cost  shocks  with  the  history  of  shocks  and  actions  contained  in  previous  cross 
sectional  densities  of  disequilibria. 

2.2      The  firm 

2.2.1        Net  Profits 

When  the  firm  is  not  investing,  its  flow  of  net  profits  is: 

(1)  n{K.9)  =  K''^e-{r  +  6)K, 

where  A'  is  the  firm's  stock  of  capital;  6^  is  a  geometric  random  walk  shock  to  the  profit 
function  which  combines  demand,  productivity  and  wage  shocks;  r  and  S  are  the  discount 
and  depreciation  rates;  and  /3  is  a  parameter  which  is  less  than  one,  capturing  our  assump- 
tion of  decreasing  marginal  profitability  of  capital,  either  due  to  decreasing  returns  in  the 
technology  or  the  presence  of  some  degree  of  monopoly  power. ^ 

For  mathematical  convenience,  we  have  written  the  profit  function  net  of  flow  payment 
on  capital,  {r  +  8)K.  Since  there  are  neither  borrowing  constraints  nor  bankruptcy  options. 


^This  should  be  contrasted  with  its  two  limiting  cases:  the  standard  (S,  s)  model,  where  the  probabLIity 
of  a  firm  adjusting  jumps  from  zero  to  one  at  the  trigger  points,  and  the  standard  linear  partial  adjustment 
models,  where  this  probability  is  independent  of  the  size  of  the  firm's  disequilibrium. 

For  concreteness,  let  the  production  function  be  Cobb-Douglas  and  homogeneous  of  degree  one  with 
respect  to  capital  and  labor,  with  capital  share  o  <  1.  Let  the  demand  faced  by  the  firm  be  isoelastic,  with 
price  elasticity  minus  ?;,  1  <  rj  <  oo.  It  follows  from  these  assumptions  that  ^  =  ^(t?  —  1)/(1  +  »(')  —  1))  <  1. 


the  solution  to  the  firm's  problem  is  unchanged  by  replacing  flow  payments  for  a  lump  sum 
payment  at  the  time  of  purchase. 

It  is  useful  to  replace  6  in  the  profit  function  by  a  variable  with  more  economic  content. 
We  do  this  by  defining  the  frictionless  stock  of  capital  of  the  firm,  K* ,  as  the  solution  of 
the  maximization  of  (1)  with  respect  to  capital,  so  that: 


=  iK' 


\-P) 


where  ,f  =  (r  +  6)1  (i.  Substituting  this  expression  into  (1),  and  defining  the  disequilibrium 
variable 

2  =  ln(A7A'*), 

allows  us  to  rewrite  the  profit  function  as: 

(2)  n(z,  IC)  =  7r(r)A"  =  ^  (e^"'  -  (3e')  IC . 

Figure  1  illustrates,  and  equation  (2)  implicitly  defines,  profits  per  unit  of  frictionless  cap- 
ital, ~{z).^ 

2.2.2      Adjustment  Costs 

When  in\'esting,  a  firm  not  only  commits  to  pay  for  the  capital  acquired,  but  also  incurs 
in  adjustment  costs.  Since  we  wish  to  capture  the  intermittent  and  lumpy  nature  of  firms' 
investments,  we  require  these  costs  to  exhibit  some  form  of  increasing  returns.  There 
are  many  ways  to  do  so.  One  possibility  is  to  follow  Grossman  and  Laroque  (1990),  and 
assume  the  firm  sells  its  old  stock  of  capital  at  a  discount  when  replacing  it  by  a  new  one. 
kx\  alternative,  with  similar  implications  for  our  purposes,  is  to  assume  firms  must  shut 
down  operations  for  a  fixed  period  of  time  when  replacing  capital.  In  the  latter  case,  which 
is  the  one  we  pursue,  the  firm  incurs  in  an  adjustment  cost  proportional  to  foregone  profits 
due  to  reorganization:^ 

Adjustment    Cost  =  u;{n(7v',6')  +  (r  +  6)K]  =  uK^O, 


^Parameters:  /?  =  0.4,  r  =  0.06  and  i  =  0.1. 
See  e.g.  Cooper  and  Haltiwanger  (1993)  for  a  model  where  the  main  cost  of  reorganization  is  its  oppor- 
tunity cost. 
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where  u  represents  the  fraction  of  profits  foregone  due  to  the  capital  stock  adjustment.  A 
derivation  similar  to  the  one  that  led  to  (2)  allows  us  to  rewrite  the  adjustment  cost  in 
terms  of  z  and  A'*: 

Adjustment    Cost  =  u^e^^   K* , 

where  z~  denotes  the  capital  imbalance  immediately  before  adjustment. 

Rather  than  treating  u>  as  fixed  — as  in  standard  (5,5)  type  models —  we  let  it  be  a 
random  variable  with  a  distribution  function,  G{u),  independent  across  firms  and  over  time, 
whose  realization  is  observed  at  the  beginning  of  each  period.  With  this  slight  generalization 
of  the  standard  fixed  cost  framework  wc  capture  — in  an  admittedly  stylized  form —  two 
realistic  features:  heterogeneity  in  adjustment  costs  at  any  point  in  time,  and  time  variation 
in  these  costs  for  any  given  firm.^°  More  importantly,  it  will  be  apparent  in  section  3 
that  this  extension  gives  us  an  important  degree  of  flexibility  when  estimating  aggregate 
investment  equations. 

2.2.3      Microeconomic  Adjustment 

Given  the  increasing  returns  nature  of  the  adjustment  cost  technology,  the  optimal  policy 
is  obviously  not  one  of  continuous  and  small  investments  but  rather  one  of  periods  of 
inaction  followed  by  occasional  lumpy  investment.  Therefore,  the  firm's  problem  can  be 
characterized  in  terms  of  two  regimes:  action  and  inaction.  Finding  a  solution  to  the  firm's 
problem  is  equivalent  to  characterizing  the  partition  of  (w,z)-space  into  these  two  regions 
and  specifying  firms'  actions  when  located  in  the  region  where  they  act.  In  what  follows 
we  present  the  basic  steps  involved  in  finding  this  solution.  In  appendix  A  we  discuss  the 
technical  aspects  and  intermediate  steps  of  the  solution  in  more  detail. 

Inheriting  the  stochastic  properties  of  6,  K'  follows  a  geometric  random  walk  (with 
drift): 

A7  =  A7_,e''-, 

where  /X(  is  i.i.d.  and,  throughout  most  of  the  paper,  Normal.  This  implies  that  when  there 
is  no  adjustment,  Zt  follows  a  random  walk  (with  drift  and  Normal  innovations).  Together 
with  the  i.i.d.  nature  of  w,  this  assumption  ensures  that  the  firm's  decision  on  whether  to 


A  more  realistic  formulation  would  let  adjustment  costs  exhibit  some  persistence  at  the  individual  level. 
It  would  also  allow  for  a  distribution  of  adjustment  costs  that  depends  on  aggregate  conditions.  We  do  not 
incorporate  these  features  into  our  model  because  they  complicate  substantially  both  the  microeconomic 
and  aggregation  problems. 


adjust  its  capital  stock  in  period  t,  and  if  so  by  how  much,  is  fully  determined  by  the  vector 
{zt,K' ,ijjt),  which  we  refer  to  as  the  'state  of  the  firm'.  The  value  of  a  firm  with  before- 
adjustment-disequilibrium  2,  frictionless  stock  of  capital  A'*,  and  (current)  adjustment  cost 
parameters  — which  we  denote  by  V'{z,K',uj) —  is  the  ma^mum  of  the  value  of  the  firm 
if  it  does  not  adjust,  V{z,K'),  and  the  value  if  it  does  adjust,  V{c,K')  -  w^e^'A'*;  where 
c  is  the  optimally  determined  return  point  (see  below).  In  short: 

(3)  V'{zt,K:,u;t)  =  m^x{V[zt,K;) ,  K(c,  A7)  -  ut^e^"K;}. 

The  evolution  of  the  value  of  a  firm  that  does  not  adjust  in  the  current  period  is  described 
by: 

(4)  V(z<,A7)  =  -(r,)A7  +  (l  +  r)-iEt[l^'(-,  +  ,,/^r+:,w<  +  ,)]. 

Since  the  profit  and  adjustment  costs  functions  are  homogeneous  of  degree  one  with  respect 
to  A'",  given  z,  so  are  the  value  functions  V{z,h'')  and  V'{z,  K' ,u:).  This  allows  us  to 
reduce  the  number  of  state  variables  by  restating  the  problem  in  terms  of  the  value  per  unit 
of  frictionless  capital.  Let  v{z)  =  V{z,K')/K'  and  v'{z,u))  =  V'{z,  K',oj)/ K' .  Dividing 
both  sides  of  equations  (3)  and  (4)  by  A''  and  noting  that 


A7 


1  -(5)e-^-''+\ 


vields 


(5)  v'{zt,u}t)     =     mcix{v{zt),  v{c) -ut^e""}, 

(6)  v{zt)     =     -(-<)+v''E,  [t''(r,+,,u;<  +  ,)e"^'"'"' 

with  c  =  (1  —  (5)/(l  +  r).  Figure  2  depicts  in  an  example  the  basic  setup  developed  up  to 
now.^'  This  figure  shows  how  v{z),  v{c)  —  u>(^e^' ,  and  v'{z,uj)  determine  the  trigger  points, 
given  a  particular  realization  of  the  adjustment  cost.  The  solid  line  illustrates  the  value  of 
a  firm  that  does  not  adjust  in  the  current  period.  The  dashed  line  represents  the  value  of 
a  firm  that  decides  to  adjust,  given  a  realization  of  uj.  The  maximum  between  both  lines 
describes  v'{z,u}),  and  the  inaction  range  —  for  a  given  u  —  corresponds  to  the  interval 


''Parameters:  /?  =  0.4,  r  =  0.06,  6  =  0.1;  the  mean  and  standard  deviation  of  the  logarithm  of  A',*  are  0 
and  0.1;  the  distribution  of  adjustment  costs  is  Gamma  with  mean  0.17  and  coefficient  of  variation  0.16.  All 
the  numbers  are  broadly  consistent  with  our  estimates  and  assumptions  in  the  empirical  part  of  the  paper. 
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between  the  intersection  of  the  two  lines. 

It  follows  directly  from  maximization  of  the  value  of  a  firm  that  decides  to  adjust, 
v{c)  -  uj^e^^ ,  with  respect  to  the  return  point  c,  that  the  maximum  of  v{2)  and  v'{z,u)  is 
obtained  at  2  =  c  and  that  this  return  point  is  independent  of  the  initial  disequilibrium. 

Proposition  Al  in  Appendix  A  shows  that  the  Bellman  equation  obtained  by  substitut- 
ing v{zt)  in  (5)  by  (6)  has  a  unique  solution,  which  is  continuous  and  bounded.  Even  though 
the  functions  v{z)  we  obtained  via  value  iteration  when  performing  estimation  always  had 
a  unique  maximum,  we  have  been  unable  to  show  this  generally.  It  follows  that,  strictly 
speaking,  the  return  point  c  should  be  interpreted  as  one  of  the  points  where  v{2)  attains 
its  maximum,  say  the  smallest  value. '^ 

The  solution  also  can  be  characterized  by  the  policy  function,  fl{z),  defined  as  the 
largest  adjustment  cost  factor  for  which  the  firm  finds  it  advantageous  to  adjust  given  a 
capital  imbalance  z.  From  the  value  matching  condition  that  equates  the  two  terms  on  the 
right  hand  side  of  equation  (5),  it  follows  that: 

(7)  ,.   -'  n{z)  =  r'e-''{v{c)-v{z)), 

which  implies: 

fi(c)  =  0. 

Differentiating  (7)  with  respect  to  _,  evaluating  the  result  at  2  =  c  and  using  the  first  order 
condition  u'(c)  =  0,  yields  the  additional  ''smooth  pasting"  condition: 

n'(c)  =  0. 

Figure  3a  illustrates  the  function  ^{z)  for  the  example  in  Figure  2,  where  the  distribution 
of  adjustment  costs  is  a  Gamma.  As  follows  from  equation  (7),  if  the  firm's  disequilibrium 
is  close  enough  to  2  =  c,  it  only  will  adjust  for  arbitrarily  small  adjustment  costs.  From 
then  on,  ^(2)  increases  with  \z  —  c\. 

Figure  3b  depicts  the  inverse  of  the  function  Q(z).  We  label  L(ii!)  and  C/(w)  the  segments 
of  the  curve  below  and  above  c,  respectively.  These  functions  correspond  to  the  maximum 
shortage  and  excess  of  capital  tolerated  by  the  firm  for  any  given  realization  of  the  adjust- 
ment cost  factor  w.    That  is,  for  any  fixed  u;,  they  describe  a  standard  {L,c,U)  policy.^^ 

In  Proposition  A5  we  show  that  the  set  of  maxima,  and  hence  of  possible  return  points,  is  finite. 
'  An  {L,c,U)  policy  corresponds  to  a  two-sided  {S,s)  model.  The  notation  L,  U  and  c  stands  for  lower 


Figure  3a 


Figure  3b 


0.24 


The  area  enclosed  by  the  two  curves  corresponds  to  the  combinations  of  disequilibria  and 
adjustment  cost  factors  for  which  the  firm  chooses  not  to  adjust. 

Proposition  A2  in  Appendix  A  derives  formally  the  existence  of  0.{z),  and  Proposition 
A3  shows  that  it  is  an  analytic  function,  and  therefore  has  derivatives  of  all  order.  Propo- 
sition A4  shows  that  i^{z)  tends  to  infinity  as  \z\  tends  to  infinity.  Yet  we  have  not  been 
able  to  show  that  rt(z)  is  unimodal,  and  therefore  have  no  formal  proof  that,  conditional 
on  uj,  the  optimal  policy  is  of  the  {L,c,U)  type.  It  should  be  noted,  though,  that  all  the 
policies  obtained  numerically  when  estimating  the  parameters  in  section  3  were  unimodal. 

The  shape  and  location  of  the  function  Q.{z)  and  its  inverse,  {L{oj),c,U{u!)),  depend 
on  the  entire  distribution  of  adjustment  cost  factors,  G'(cj).  A  given  realization  of  the 
adjustment  cost  factor  will  not  generate  the  same  inaction  range  for  different  distribution 
functions  G{lj).  In  particular,  a  low  value  of  u)  is  more  likely  to  lead  to  action  when  it 
comes  from  a  distribution  of  adjustment  cost  factors  with  a  high  rather  than  a  low  average 
value. 

2.2.4      Adjustment  hazard,  expected  investment  and  ergodic  density 

Adjustment  hazard  Above  we  showed  that  for  any  given  u  the  firm  follows  a  simple 
deterministic  policy  with  respect  to  z:  actions  are  taken  only  when  z  lies  outside  the 
{L{iL!).U{u))  interval,  in  which  case  investment  occurs  so  as  to  bring  z  back  to  c.  With 
aggregation  in  mind,  here  we  reduce  the  amount  of  information  contained  in  the  policy. 
Rather  than  conditioning  on  w,  we  only  use  information  on  its  distribution  and  ask  the 
question:  what  is  the  probability  that  a  firm  with  disequilibrium  z  adjusts? 

The  answer  to  this  question  is  contained  in  what  we  call  the  adjustment  hazard.  Let 
X  =  z  —  c  denote  a  firm's  imbalance  with  respect  to  its  target  point.  A  firm  with  deviation 
X  adjusts  only  if  current  adjustment  cost  is  small  enough  to  make  adjusting  profitable  (i.e. 
if  u;  <  n(x  +  c)).  Which  means  that  the  probability  of  a  firm  adjusting,  conditional  on  its 
disequilibrium  being  equal  to  x  (the  adjustment  hazard),  is  given  by: 

(8)  A(2)  =  G(n(x  +  c)), 

where  G{u))  denotes  the  cumulative  distribution  function  for  the  adjustment  cost  factor 
uj.    For  example,  if  G{u))  is  a  Gamma  distribution  with  mean  pcj)  and  variance  p<^^,  the 


bound,  upper  bound,  and  "center,"  respectively.  See  Harrison  et  al.  (1983) 
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adjustment  hazard  is: 


^W  =  ^/o'   ■"■"'"'""'*''"■ 


<i.'r(p)  Jo 

Figure  4a  shows  the  adjustment  hazard  function  for  three  different  gamma  distributions  of 
adjustment  cost  factors.  These  distributions  differ  in  their  mean  and  variances:  the  solid  line 
corresponds  to  an  exponential  distribution  (mean  and  standard  deviation  of  0.1);  the  long 
dashes  correspond  to  a  high  variance  and  mean  distribution  (mean:  80,  standard  deviation: 
282);  while  the  short  dashes  describe  a  low  variance  and  mean  distribution  (mean:  0.14; 
standard  deviation:  0.044).  These  examples  illustrate  well  the  range  of  cases  covered  by 
our  setup.  Figure  4a  shows  that  when  the  variance  of  adjustment  costs  is  low,  there  is  a 
range  of  adjustment  costs  where  the  firm  (almost)  never  adjusts  since  adjustment  costs  are 
(almost)  never  small  enough  to  justify  it;  the  standard  {S,s)  —  or  {L,c,U)  —  case  is  an 
extreme  version  of  this.  Conversely,  when  the  variance  of  adjustment  costs  is  high,  and  so 
is  their  mean,  the  decision  of  adjustment  is  largely  motivated  by  the  adjustment  cost  draw 
rather  than  by  the  firm's  disequilibrium;  in  the  limit,  adjustment  costs  are  independent  of 
the  firm's  disequilibrium,  yielding  the  standard-linear  partial  adjustment  model. 

In  Proposition  B2  in  Appendix  B  we  show  that  A(x)  is  differentiable,  with  A(0)  =  0 
and  lim|^|_^  A(a:)  =  1. 

Expected  investment  A  firm  with  disequilibrium  x  has  a  probability  A(a:)  of  adjusting  its 
stock  of  capital  and,  if  it  does  so,  it  invests: 

(e<^  -  e^)A7  =  (e--  -  Ije-'AY  =  {e'-  -  l)A',(x). 

Thus  the  average  investment  of  firms  with  disequilibrium  x  immediately  before  adjusting 
their  capital  stock  in  period  t  is: 

(9)  E^[/((x)|i]  =  A(x)(e-^  -  l)A'((i). 

Figure  4b  depicts  expected  investment  corresponding  to  the  hazards  in  figure  4a  (with 
A't(x)  =1).  The  nonlinear-convex  nature  of  expected  investment  is  an  important  feature 
of  the  model,  playing  a  key  role  in  shaping  aggregate  investment  dynamics.  It  says  that 
incentives  to  invest  rise  more  than  proportionally  with  a  firm's  disequilibrium. '"^ 


Which  is  a  feature  not  shared  by  the  standard  quadratic  adjustment  cost  model  but  certainly  not 
exclusive  of  models  with  fixed,  or  even  non-convex,  adjustment  costs. 
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Figure  4a 
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Figure  4b 
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Ergodic  density  We  conclude  our  characterization  of  microeconomic  behavior  by  stating 
that  the  disequilibria  a  firm  goes  through  over  its  lifetime,  have  an  invariant  density.  The 
formal  proof,  which  also  shows  that  convergence  takes  place  at  an  exponential  rate,  is  given 
in  Appendix  B.  The  nonlinear  nature  of  microeconomic  adjustment,  with  relatively  small 
adjustment  for  small  imbalances,  imprints  the  opposite  pattern  on  the  ergodic  density: 
steeper  hazards  translate  into  relatively  less  mass  in  the  tails  of  the  corresponding  invariant 
density  in  exchange  for  more  mass  in  the  regions  of  low  values  of  the  adjustment  hazard 
(i.e.,  relatively  platokurtic).^^ 

2.3      Sectoral  investment 

2.3.1      Sectoral  investment  and  the  cross  sectional  density 

Let  Ki",  if^,  ^"^'ii^)  a-nd  /((x)  denote  the  aggregate  (sectoral)  stock  of  capital  and  gross 
investment,  and  the  stock  of  capital  and  gross  investment  held  by  firms  with  disequilibrium 
X  at  time  t  (before  adjustment). 

Since  adjustment  costs  shocks  are  i.i.d.  across  firms,  it  follows  directly  from  (9)  that: 


(10)  It{x)  =  {€-''  -l)\{x)Kt{x), 


where  A'((x)  denotes  the  average  stock  of  capital  of  firms  with  imbalance  x. 

Letting  /(i,  t)  denote  the  cross  sectional  density  of  disequilibria  just  before  adjustments 
take  place,  we  can  obtain  an  expression  for  aggregate  investment: 


If  =/(e""-  \)i\(x)h\{x)fix,t)dx. 

Dividing  through  by  Kf  and  rearranging  terms,  we  obtain  an  expression  for  the  aggre- 
gate investment/capital  ratio: 

(11)        |4  =  /(^"'  -  m{^)f{x,t)dx  +  |(e--  -  l)A(x)  (^^  -  ij  f{x,t)dx. 


The  second  term  on  the  right  hand  side  of  (11)  drops  out  if  (A'((a:)  -  A'/*)  and  (e   ^  -  l)A(x) 
are  uncorrelated.  Since  such  an  assumption  simplifies  computations  substantially,  we  make 


'^The  U.S.  manufacturing  plant  level  data  studied  in  Caballero,  Engel  and  Haltiwanger  (1995)  revealed 
that  the  average  observed  distribution  of  disequilibria  is  considerably  more  platokurtic  than  the  average 
distribution  of  shocks  affecting  these  plants. 
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it  and  obtain  an  approximate  expression  for  the  aggregate  investment/capital  ratio: 


(12) 


-^-  J{e-^-l)A{x)fix,t)dx. 


In  appendix  C  we  describe  in  detail  the  additional  computational  burden  of  using  (11) 
instead  of  (12),  and  present  the  results  of  Monte  Carlo  simulations  showing  that  the  cost 
of  the  approximation  is  only  minor. '^ 

It  is  apparent  from  (12)  that  since  (e~^  -  l)A(x)  is  generally  nonlinear  in  x,  aggregate 
investment  depends  not  only  on  the  first  but  also  on  higher  moments  of  the  cross  sectional 
distribution  of  disequilibria. 

2.3.2      The  linear/partial  adjustment  extreme 

An  exception   to  the  statement  in  the  previous  paragraph  occurs  when  the  adjustment 
hazard  does  not  depend  on  x  and  f  has  most  of  its  mass  near  x  =  0,  so  that  e"^  —  1  ~  —x. 
In  that  case 
(13)  -|^~-AoA',, 

where  Aq  denotes  the  constant  hazard,  and  A'(  denotes  the  average  disequilibrium  before 
adjustment.  Equation  (13)  corresponds  to  the  well  known  partial  adjustment  model  (PAM), 
and  also  coincides  with  the  standard  linear  equation  arising  from  the  quadratic  adjustment 
costs  model.  This  is  the  only  adjustment  hazard  that  does  not  require  cross  sectional 
information  on  the  right  hand  side  of  the  aggregate  investment  equation.  Indeed,  a  few 
steps  of  algebra  allow  us  to  go  from  equation  (13)  to  the  standard  expression:'' 

tA  tA 

•'(      _   A     /J    ,    ..  ^    ,    /I         A    ^  ''(-1 


(14)  -j^  =  Ao(<5  +  t'()  +  (l-Ao)    ^^    , 

where  Vi  represents  an  aggregate  shock,  to  be  defined  in  the  next  paragraph. 


The  results  in  CabaJlero,  Engel  and  Ha.ltiwanger  (1995)  are  reassuring  on  this  respect.  There  we  used 
comprehensive  establishment  level  data  for  U.S.  manufacturing  during  the  70s  and  80s,  and  documented  a 
very  close  empirical  fit  between  the  actual  and  approximate  series  during  that  period. 

Let  kt  and  k'  denote  the  average  of  the  logarithm  of  the  pre-adjustment  stock  of  capital  and  the 
frictionless  stock  of  capital,  respectively.  We  define  u,  =  A/t*,  and  note  that  Afci  ~  (/,'1]/A','l])  -  8. 
Combining  these  two  expressions  with  the  fact  that, 

A',  =  k,  -  k:  =  Ak,  -  Ak:  +  .V,-i , 
yields  (14). 
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2.3.3      Sectoral  equilibrium  and  cross  sectional  dynamics 

Shocks  to  wages,  demand,  and  productivity  drive  the  dynamics  of  frictionless  capital.  We 
decompose  these  shocks  into  sectoral  shocks,  vt,  and  firm  specific  (idiosyncratic)  shocks,  cj: 


K'  =  A7_ie 


Vl+U 


which  implies  that  when  the  firm  does  not  adjust,  the  disequilibrium  measure  x  evolves 
according  to: 

Axt  =  ~{6  +  Vt)  -  €t, 

where  capital  depreciates  at  a  rate  S  from  one  period  to  the  next.  We  assume  these  shocks 
are  exogenous  to  the  firm  and  the  sector. 

Between  two  consecutive  periods,  the  cross  section  distribution  of  disequilibria  changes 
as  a  result  of  firms'  adjustments,  depreciation,  sectoral  and  idiosyncratic  shocks.  Since  we 
are  working  in  discrete  time,  it  is  important  to  describe  the  timing  convention  we  adopt  for 
events  within  each  period.  We  denote  the  cross  section  density  at  the  end  of  period  t  —  I  hy 
f{x,  /  —  1).  Depreciation  and  the  aggregate  shock  corresponding  to  period  t  follow,  resulting 
in  the  density  f{x,t).  Next  come  adjustments,  as  determined  by  the  hazard  function  A(2;). 
Period  t  concludes  with  the  idiosyncratic  shocks.  The  final  density  is  f{x,t),  and  the  cycle 
starts  again.  Recalling  that  a  positive  shock  leads  to  a  decrease  in  x,  we  can  summarize 
this  chain  of  events  as  follows: 


(15)  f(x,i) 

(16)  f{x,t) 


f{x  +  6  +  vt,t-  1), 


j  A{y)f{y,t)dy 


g,{-x)  +  /  [1  -  A(a;  +  e)]/>  +  €,t)g,{-e)  de, 


where  g^{c)  is  the  probability  density  for  the  idiosyncratic  shocks.  The  integro-difference 
equation  describing  the  evolution  of  the  cross  sectional  distribution  from  one  period  to  the 
next  follows  directly  from  equations  (15)  and  (16): 


(17) 


fix,t) 


J Hy)fiy  +  s  +  vt,t-i)dy  g,(-x) 

+  y [1  -  A{x  +  €)]f{x  +  €  +  6  +  vt,t-  l)g,{-e)  de. 
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From  equations  (12)  and  (15)  we  obtain  the  following  aggregate  investment  equation: 

(18)  -^  =  J{e-^  -  l)Aix)fix  +  6  +  v„t  -  l)dx. 

Combining  equations  (18)  and  (17)  we  can  determine  the  sequence  of  aggregate  investment 
determined  by  an  initial  cross  section  distribution,  /(2:,0),  and  a  sequence  of  aggregate 
shocks,  {vt].  For  details  see  Appendix  C.  We  turn  to  estimation  issues  next. 

3     Empirical  Evidence:  U.S.  Investment 

3.1  Data 

Our  data  are  constructed  from  annual  gross  investment  and  capital  series  for  21  two-digit 
manufacturing  industries  from  1947  to  1992.'^  All  series  are  in  1987  dollars,  and  the  stocks 
of  capital  correspond  to  the  series  used  by  the  Bureau  of  Labor  Statistics  for  their  produc- 
tivity studies.'^  Since  capital  stocks  are  end-of-year,  our  measures  of  the  investment/capital 
ratio  used  in  estimation  start  in  1948.  We  report  separate  results  for  equipment  and  struc- 
tures panels;  each  has  945  observations. 

3.2  Econometrics 

The  econometric  problem  consists  of  estimating  the  parameters  that  characterize  (a)  firms' 
profit  functions,  (b)  the  initial  distribution  of  disequilibria,  (c)  the  distribution  of  adjust- 
ment costs,  (d)  the  distribution  of  idiosyncratic  shocks,  and  (e)  the  process  generating 
aggregate  shocks.  In  this  subsection  we  outline  the  main  features  of  the  estimation  proce- 
dure; a  detailed  description  is  presented  in  Appendix  C. 

For  tractability,  we  limit  the  number  of  parameters  being  estimated  to  those  character- 
izing the  distribution  of  adjustment  costs  or,  equivalently,  the  hazard  function.  As  far  as  the 
remaining  parameters,  we  either  fix  them  (profit  function  and  distribution  of  idiosyncratic 
shocks),  show  that  their  role  is  limited  within  a  reasonable  range  (initial  distribution  of  dis- 
equilibria) or  concentrate  them  out  of  the  likelihood  function  (process  generating  sectoral 
shocks:  individual  effects  and  cross  sectoral  variance-covariance  matrix).  Since  identifying 
nonlinearities  from  a  purely  time  series  (as  opposed  to  regressions)  dimension  requires  a 

We  have  21  rather  than  20  sectors  because  Motor  Vehicles  is  separated  from  Transportation  equipment. 
This  is  one  of  the  three  capital  stock  series  reported  by  the  Bureau  of  Econoniic  Activity. 
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large  number  of  observations,  we  impose  a  hazard  function  or  a  distribution  of  adjustment 
costs  that  is  common  across  sectors  (depending  on  whether  we  estimate  a  semi-structural 
or  structural  model  —  see  below). 

The  source  of  randomness  in  our  estimation  procedure  are  the  sectoral  shocks,  which  we 
assume  multivariate  Normal  and  independent  over  time,  for  most  of  our  empirical  analysis. 

We  approximate  the  initial  sectoral  cross-sections  by  the  invariant  cross-section  of  an 
individual  plant,  and  proceed  to  use  the  Markovian  nature  of  the  process  generating  cross- 
sectional  distributions  to  generate  these  distributions.  For  each  sector,  and  at  each  date,  the 
cross  sectional  distribution  is  updated  as  a  function  of  the  sectoral  shock,  using  an  implicit 
law  of  large  numbers  at  the  microeconomic  (firm)  level.  The  observed  sectoral  investment 
rate  is  a  non-linear  function  of  the  current  shock  and  the  distribution  prior  to  this  shock 
—  this  function  is  one-to-one  in  the  shock.  Conditional  on  the  initial  distribution,  the 
sequence  of  sectoral  shocks  and  the  cross  section  distributions  can  be  recovered  from  the 
time  series  of  sectoral  investment  rates.  The  likelihood  is  calculated  using  these  sectoral 
shocks.  We  also  need  to  calculate  the  corresponding  Jacobian  terms,  which  correspond  to 
the  elasticities  of  sectoral  investment  rates  with  respect  to  sectoral  shocks.  These  elasticities 
are  a  byproduct  of  the  calculation  of  sectoral  shocks. 

3.3      Semi-structural  and  structural  models 

We  estimate  two  basic  models.  In  the  first  one  (semi-structural),  we  estimate  directly 
the  parameters  of  an  ad-hoc  adjustment  hazard.  While  in  the  second  one  (structural), 
we  estimate  the  adjustment  costs  parameters  and  obtain  the  implied  liazard  via  dynamic 
programming.  Both  of  these  models  yield  increasing  hazards, ^°  but  include  a  constant 
hazard  as  a  limit  case. 

Semi-structural.  Although  our  main  goal  in  the  paper  is  to  estimate  and  assess  the 

structural  model,  there  are  good  reasons  to  start  by  estimating  a  less  structured  version. 
It  allows  us  to  search  and  test  for  the  presence  of  an  increasing  hazard  more  directly,  and 
it  facilitates  comparisons  with  standard  linear  models.  Furthermore,  by  assuming  that  the 
adjustment  hazard  is  an  inverted  Normal: 

(19)  A(x)  =  l-e-^°-^^^'. 

By  increasing  hazard  we  mean  a  hazard  that  is  increases  with  |i|,  i.e.,  that  is  decreasing  for  i  <  0  and 
increasing  for  i  >  0. 
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where  Aq  >  0  and  A2  >  0,  we  are  able  to  obtain  accurate  computations  and  to  reduce 
estimation  time  significantly  (by  a  factor  of  12)  by  keeping  the  cross-section  distributions 
within  a  closed  family  of  mixture  of  Normals  (see  Appendix  C). 

Structural.  Rather  than  estimating  the  adjustment  hazard  directly,  in  this  case  we 

estimate  the  parameters  of  the  adjustment  costs  function  and  obtain  the  hazard  from  the 
solution  of  the  dynamic  optimization  problem  presented  in  section  2.  Adjustment  costs  are 
drawn  from  a  Gamma  distribution: 

which  has  mean  fi^  =  p(f)  and  a  coefficient  of  variation  cv^  =  1/^.  We  estimate  /i^^  and 
cv^. 

As  for  the  other  structural  parameters,  we  assume  an  interest  rate,  share  of  each  type 
of  capital,  and  markup  of  6,  15  and  20  percent,  respectively;^'  as  well  as  depreciation  rates 
for  equipment  and  structures  of  10  and  5  percent  per  year,  respectively.'^^  We  estimated 
the  standard  deviation  of  idiosyncratic  shocks,  c^,  obtaining  estimates  in  the  range  of  5 
to  15  percent.  However,  since  these  were  not  estimated  very  precisely,  and  comparisons 
across  models  are  easier  if  idiosyncratic  variances  are  similar,  we  only  report  results  where 
wc  liave  imposed  a^  =  0.1  (both,  in  semi-structural  as  well  as  structural  estimation). 

3.4      Main  results 

Table  1  contains  our  main  results.  The  first  two  columns  present  semistructural  and  struc- 
tural results  for  equipment  investment,  while  the  last  two  do  the  same  for  investment  in 
structures.  All  estimated  models  allow  for  individual  effects  on  the  sectoral  shocks,  and 
include  a  free  additive  constant  that  is  common  across  sectors. 

Semistructural  The  semistructural  results  allow  us  to  reject  the  constant  hazard  model, 
in  favor  of  an  increasing  hazard  one.    The  increasing  hazard  parameter,  A2,  is  significant 


These  parameters  imply  a  value  of  /?  around  0.45  if  the  production  function  is  constant  returns  and  all 
other  factors  of  production  (including  the  other  type  of  capital)  are  fully  flexible,  around  0.3  if  all  factors 
but  the  other  form  of  capital  are  flexible,  and  around  0.15  if  all  other  factors  are  fixed.  Our  conclusions 
are  robust  to  reasonable  variations  of  /?,  but  we  do  not  have  enough  power  to  identify  this  parameter  in 
conjunction  with  those  that  we  estimate.  The  results  we  report  assume  jS  =  0.4. 

Although  there  is  a  slight  upward  trend  in  the  sample,  these  depreciation  rates  are  consistent  with  the 
average  depreciation  rate  computed  from  the  ratio  of  actual  depreciation  to  net  capital  stocks  reported  in 
Fixed  Reproducible  Tangible  Wealth  in  The  United  states,  1925-89. 
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Table  1:  Main  Results 


Parameters 

Equipment 

Structures 

Semi-structural      Structural 

Semi-structural 

Structural 

Ao 

0.155 

0.000 

(0.067) 

(0.054) 

A2 

2.804 

2.437 

(1.165) 

0.878) 

constant 

0.057                 -0.006 

0.013 

0.019 

(0.013)                (0.016) 

(0.006) 

(0.002) 

Mcv 

0.166 

0.228 

(0.029) 

(0.046) 

cv^ 

0.327 

0.066 

(0.109) 

(0.009) 

LOG-LIK. 

2430.2                2431.4 

2612.4 

2637.2 

Standard  deviations  in  parenthesis. 


at  the  one  percent  level  in  both  cases.  The  estimated  hazard  function  suggests  that  the 
probability  that  a  firm  adjusts  its  capital  stock  of  equipment  increases  from  about  14%  for 
small  imbalances  to  45%  when  its  imbalance  is  40%,  while  it  goes  from  close  to  zero  for 
small  imbalances  to  about  32%  for  a  40%  imbalance,  in  the  case  of  structures.  The  sharp 
nonlinearity  can  also  be  captured  through  the  expected  invested/capital  ratio  (conditional 
on  the  imbalance);  for  equipment,  it  goes  from  close  to  0.05  at  a  20  percent  imbalance  to 
0.23  at  a  40  percent  imbalance,  while  for  structures  it  goes  from  0.02  to  0.16,  for  the  same 
imbalances. 

Structural.  The  results  of  the  structural  models  confirm  the  semistructural  increasing 
hazard  findings.  Moreover,  the  likelihoods  rise,  especially  so  for  structures. 

The  estimates  of  the  mean  of  the  distribution  of  adjustment  costs  (the  fi^^s)  indicate  that 
the  average  adjustment  cost  drawn  is  the  equivalent  of  16.7  percent  of  a  year's  operational 
profits  for  equipment  and  22.8  percent  for  structures.    Since  firms  can  "search"  for  a  low 
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realization  of  adjustment  costs,  these  are  upper  bounds  for  the  average  costs  effectively  paid 
by  firms  when  going  through  a  major  adjustment  episode.  Indeed,  the  average  costs  paid 
are  11.1  percent  for  equipment  and  21.4  percent  for  structures.  The  difference  between  the 
unconditional  and  conditional  (on  adjustment)  means  rises  with  the  coefficient  of  variation 
of  these  costs,  which  explains  why  adjustment  costs  actually  paid  for  equipment  are  one 
third  less  than  the  mean  adjustment  cost  faced  by  firms  (see  cv^^)  while  in  the  case  of 
structures  both  means  are  very  similar. 

3.5      Simple  alternatives 

We  view  the  microeconomic  foundation  of  our  approach  as  one  of  its  main  virtues,  however 
in  this  section  we  look  purely  at  the  statistical  advantage  of  our  structural  models  over 
simple  linear  counterparts.  We  do  not  intend  to  conduct  "horse  races"  against  alternative 
investment  models  but  want  to  provide  a  simple  metric  to  assess  the  contribution  of  the 
nonlinearities  we  estimated  to  the  time  series  properties  of  aggregate  investment. 

The  first  obvious  step  is  to  compare  our  mode!  with  the  "almost"  nested  PAM.  As 
we  argued  above,  the  constant  hazard  model  (A2  =  0),  together  with  the  approximation 
e~^  -  1  ~  -X,  yields  the  standard  PAM,  which  corresponds  to  estimating  an  AR(1)  for 
each  sector's  aggregate  investment  series.  For  comparability  with  the  structural  model,  we 
constrain  the  correlation  coefficient  to  be  the  same  across  sectors:^'^ 

tA  tA 

■'if  ,      -'it-i     , 

•'Sf  •'Mi-l 

More  generally,  we  also  run  an  AR(2)  with  unconstrained  autoregressive  coefficients  for 
each  sector: 

't  „      I    „        "'-1      I  ^tt-2      , 

Tm  =  °'  +  P^'-Fa-  +  P^'T'A-  +  ^■'• 

■'^it  ■'Sl-1  -'^.(-2 

In  both  cases  we  preserve  the  assumption  of  jointly  Normal  aggregate  shocks,  and  allow 
for  individual  effects.  We  look  at  within  and  out-of-sample  criteria,  and  find  widespread 
evidence  supporting  the  structural  nonlinear  model  over  the  linear  representations  for  both 
equipment  and  structures. 

Within  sample  criteria. 

The  likelihoods  of  the  linear  models  are  uniformly  lower  than  those  of  the  corresponding 


Recall  that  the  structural  has  the  same  distribution  of  adjustment  costs  across  sectors. 
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structural  models,  even  for  the  AR(2)s  —  which  have  39  parameters  more  than  the  nonlinear 
structural  models.  But  comparing  the  likelihoods  is  not  strictly  correct,  since  the  linear 
models  (especially  the  AR(2)s)  are  not  nested  in  our  structural  models.  Instead,  we  use 
the  test  for  non-nested  models  developed  by  Vuong  (1989)  and  Rivers  and  Vuong  (1991). 
Let  /[  and  I2  denote  the  maximum  value  attained  by  the  log-likelihood  for  models  1 
and  2,  T  denotes  the  number  of  periods  considered  when  calculating  the  likelihood  (42,  in 
our  case),  rti  and  722  denote  the  number  of  parameters  for  models  1  and  2,  respectively, 
and  S  denotes  the  Newey-West  estimate  for  the  variance  of  the  time  series  of  likelihood 
differences.  The  null  hypothesis  is  that  both  models  are  vT-asymptotically  equivalent;^"* 
should  this  be  the  case  we  have  that  the  test  statistic 

(20)  ^^^^  ^  h  -  I2  -  Ijn,  -  n,)\ogiT) 


VST 

has  a  Standard  Normal  distribution.    Positive  values  of  Vi^2  indicate  evidence  in  favor  of 
model  1;  negative  values  evidence  in  favor  of  model  2.^^ 

Table  2  presents  Vuong's  statistics  and  the  p- values  for  the  test  that  both  models  (lin- 
ear and  nonlinear)  are  equally  close  to  the  'true'  model,  against  the  alternative  that  the 
(nonlinear)  structural  model  is  better.  It  is  apparent  that  the  null  hypothesis  is  rejected  in 
favor  of  the  alternative  even  at  very  low  significance  levels. 

One  possible  reason  for  the  bad  relative  performance  of  linear  models  is  that  we  have 
assumed  that  aggregate  shocks  are  Normally  distributed.  Sectoral  investment  rates,  on 
the  other  hand,  are  clearly  not  Normal;  the  skewness  and  (excess)  kurtosis  coefficient  of 
(standardized,  for  every  sector)  investment  rates  are  0.61  and  0.74  for  equipment  and  0.76 
and  0.87  for  structures.  Obviously,  linear  models  with  Normal  errors  cannot  account  for 
these  departures  from  Normality.  The  innovations  generated  by  the  best  partial  adjustment 
model  and  best  second-order  autoregressive  models  also  depart  from  Normal,  as  can  be  seen 
in  table  3:  Their  skewness  and  kurtosis  coefficients  are  0.49  and  1.15  for  equipment  and  0.95 


'M.e.,  limr-oc  VT(/,  -  h)  =  0. 
Note  that:   First,  the  numerator  of  (20)  contains  a  penalty  term  -  the  Bayesian  Information  Criterion, 
BIC  -  that  corrects  for  differences  in  degrees  of  freedom  between  both  models.   Second,  denoting  the  sum 
of  sectoral  likelihoods  for  model  i  at  time  (  by  /,,,  i  =  1,2,  and  d,  =  h,  —  lit,  we  have  that  the  S  in  the 
denominator  of  Vi,2  is; 

5(r,?)  =  7o  +  2^ 


?  +  l 


7j. 


where  7^  denotes  the  sample  autocorrelation  of  order  j  of  the  d,  time  series.  Since  in  all  cases  S{T,  q)  does 
not  vary  much  for  values  of  q  larger  than  7,  we  choose  q  =  &  when  calculating  S  in  (20). 
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Table  2:  Nonnested  Models  Test:  Normal  Shocks 


Eq 

uipment 

St 

ructures 

PAM 

AR(2)-UNC 

PAM 

AR(2)-UNC 

Vuong  statistic 

2.61 

4.25 

2.92 

4.07 

p- value 

0.0045 

<  0.0001 

0.0018 

<  0.0001 

NOTES:  Vuong  statistic  calculated  as  in  (20).  All  test  statistics  compare  the  structural 

model  estimated  in  Table  1  with  the  model  in  the  table.  If  both  models  are  'equally 

good',  the  asymptotic  distribution  of  the  statistic  is  Standard  Normal.  Large  positive 

values  provide  evidence  in  favor  of  the  structural  model. 


and  1.88  for  structures  in  the  partial  adjustment  case,  and  0.38  and  1.00  for  equipment, 
and  0.86  and  1.65  for  structures  in  the  AR(2)  case.  All  these  numbers  are  significantly 
different  from  zero  (the  Normal  case)  at  the  0.001  level  (estimated  via  bootstrap).  The 
last  two  rows  show  that  the  increasing  hazard  model  generates  innovations  that  are  closer 
to  Normal  than  its  linear  and  constant  hazard  counterparts.  The  estimated  skewness  and 
kurtosis  coefficients  are  considerably  smaller,  and  both  skewness  coefficients  do  not  depart 
significantly  (at  the  0.05  level)  from  their  values  under  the  Normality  assumption.  The 
increasing  hazard  model  does  not  need  to  introduce  nearly  as  much  skewness  and  kurtosis 
in  aggregate  shocks  to  account  for  investment  behavior. 

Normality  is  the  natural  assumption  when  aggregate  shocks  are  conceived  as  the  sum  of 
a  wide  variety  of  small  shocks  with  limited  dependence  (by  the  Central  Limit  Theorem).  In 
spite  of  this,  we  momentarily  relax  this  assumption  in  order  to  consider  shocks  which  admit 
skewness  and  kurtosis  properties  similar  to  those  observed  in  sectoral  investment/capital 
ratios.  For  this  purpose,  we  generalized  the  distribution  of  the  residual  to  consider  convex 
combinations  of  Normal  and  log-Normal  distributions.  The  log-Normal  component  did  not 
add  significantly  to  the  structural  model,  while  the  linear  models  assigned  most  of  the 
weight  to  the  log-Normal  component.  For  this  reason  we  compare  the  structural  models 
with  NormaJ  shocks  versus  the  linear  models  with  log-Normal  shocks  in  table  4.  Although 
the  likelihood  in  the  linear  model  improves  substantially  with  this  modification,  table  4 
shows  that  the  test  for  non-nested  models  still  favor  the  (nonlinear)  structural  models  by 
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Table  3:  Skewness  and  kurtosis  for  innovations* 


Model 

Equi 

pment 

Struct 

ures 

Skewness 

Kurtosis 

Skewness 

Kurtosis 

Part.  Adj. 

0.49 

1.15 

0.95 

1.88 

(0.13) 

(0.36) 

(0.15) 

(0.65) 

AR(2) 

0.38 

1.00 

0.86 

1.65 

(0.12) 

(0.31) 

(0.14) 

(0.60) 

Structural 

-0.04 

0.65 

0.17 

0.79 

(0.11) 

(0.20) 

(0.13) 

(0.34) 

*  Shown  in  parenthesis:  standard  deviation,  obtained  via  bootstrap. 

a  wide  margin.  In  fact,  the  reduction  in  the  denominator  due  to  the  increased  precision  of 
the  test  (the  likelihoods  become  more  correlated  across  models)  more  than  outweighs  the 
increase  in  the  likelihood  of  the  linear  model. 

Out-of-sample  criteria. 

Next,  we  evaluated  the  out-of-sample  forecasting  performance  of  our  model.  For  this 
purpose  we  reestimated  the  nonlinear  structural  and  AR(2)  models  dropping  ten  percent  of 
our  observations  (the  last  five  years  for  each  of  our  21  sectors),  and  generated  the  one-step- 
ahead  forecast  distributions,  for  each  sector  and  year  out  of  the  sample.  We  only  evaluated 
the  model's  performance  relative  to  that  of  an  AR(2)  using  a  standard  Mean-Square-Error 
criterion,  although  this  reduces  the  potential  forecasting  edge  of  nonlinear  models. ■^^  We 
postpone  further  discussion  of  forecasts'  higher  moments  until  the  conclusion. 

Table  5  reports,  for  each  investment  type,  the  average  (across  sectors)  MSE  for  each 
year  (first  two  columns),  and  the  percentage  increase  in  MSE  generated  by  the  AR(2) 
model  over  the  nonlinear  one  (third  column).  Except  for  1988,  the  structural  nonlinear 
model  systematically  outperforms  the  AR(2)  representation.  This  is  particularly  true  for 
structures,  where  the  gain  is  over  35  percent  during  four  out  of  the  five  years  for  which  we 
generated  out-of-sample  forecasts. 

This  favorable  evidence  for  the  nonlinear  model  is  reinforced  by  table  6.    It  reports, 


See  Ramsey  (1996)  for  arguments  on  the  bias  against  nonlinear  models  inherent  in  MSE  comparisons. 
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Table  4:  Nonnested  Models  Test:  Log-Normal/Normal  Shocks 


test  (Normal) 


p-value 


Equipment 
PAM      AR(2)-UNC 


2.42 


7.97 


0.0078        <  0.0001 


Structures 
PAM      AR(2)-UNC 


3.57 


4.53 


0.0002        <  0.0001 


NOTES:  Vuong  statistic  calculated  as  in  (20).  All  test  statistics  compare  the  structural 

model  estimated  in  Table  1  with  the  model  in  the  table.  If  both  models  are  'equally 

good',  the  asymptotic  distribution  of  the  statistic  is  Standard  Normal.  Large  positive 

values  provide  evidence  in  favor  of  the  structural  model. 


Table  5:  Average  MSE  for  1-Step-Ahead  Forecasts:   1988-92 


Equipment 

Structures 

Year 

AR(2) 

MODEL 

ln((l)/(2)) 

AR(2) 

MODEL 

ln((l)/(2)) 

88 

0.591 

0.640 

-0.075 

0.306 

0.340 

-0.092 

89 

0.240 

0.230 

0.038 

0.371 

0.220 

0.502 

90 

0.308 

0.270 

0.131 

0.207 

0.150 

0.359 

91 

0.566 

0.540 

0.044 

0.507 

0.350 

0.371 

92 

0.399 

0.330 

0.194 

0.518 

0.340 

0.425 

The  parameters  were  estimated  using  data  up  to  1987.  MSEs  are  multiplied  by  10 
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for  each  investment  type,  the  average  (across  years)  MSE  for  each  sector  for  the  AR(2) 
and  nonlinear  models  (first  two  columns),  and  the  percentage  increase  in  MSE  generated 
by  the  AR(2)  model  over  the  nonlinear  one  (third  column).  At  the  bottom  of  the  table, 
we  report  the  percentage  increase  in  average  (across  sectors  and  time)  MSE  generated  by 
the  AR(2)  model  over  the  nonlinear  one,  as  well  as  the  median  (across  sectors)  increase. 
The  sectoral  dimension  is  one  along  which  we  would  have  expected  the  nonlinear  model  to 
do  relatively  poorly,  since  in  order  to  gain  statistical  power  for  the  nonlinearities  we  were 
forced  to  impose  the  same  distribution  of  adjustment  costs  across  sectors,  which  is  not  likely 
to  hold  too  closely  in  the  data.  The  AR(2),  on  the  other  hand,  has  no  constraints  across 
sectors.  However,  table  6  shows  that  even  under  this  unfavorable  metric  the  structural 
model  outperforms  the  unconstrained  AR(2)  representation.  Again,  this  is  particularly 
true  for  structures,  where  the  gain  in  terms  of  MSE  is  over  15  percent  for  the  median 
sector,  and  above  30  percent  for  the  average  MSE. 

3.6        Dynamic  implications  of  increasing  hazard  models. 

Perhaps  the  main  distinctive  feature  of  the  model  we  have  estimated,  compared  with  its 
linear  counterparts  or  a  constant  hazard  model,  is  that  not  only  average  investment  by  those 
that  are  investing  but  also  the  number  of  firms  that  choose  to  invest  at  any  point  in  time 
fluctuates  over  the  business  cycle.  This  is  a  realistic  feature  according  to  the  establishment- 
level  evidence  in  Doms  and  Dunne  (1993).  Among  many  interesting  facts,  they  show  that 
the  number  of  plants  going  through  their  primary  investment  spikes  (i.e.,  the  single  year 
with  the  largest  investment  for  the  establishment),  rather  than  the  average  size  of  these 
spikes,  tracks  closely  aggregate  manufacturing  investment  over  time. 

In  terms  of  our  model,  this  flexibility  in  the  number  of  firms  investing  implies  that 
the  extent  of  the  response  of  aggregate  investment  to  aggregate  shocks  fluctuates  over  the 
business  cycle.  Figure  5  depicts  the  paths  of  the  median  (across  sectors)  derivatives  of 
aggregate  investment  with  respect  to  aggregate  shocks  for  equipment  and  structures.'^^  It 
is  apparent  that  this  "index  of  responsiveness"  fluctuates  widely  over  the  sample.  Moreover, 
it  is  strongly  procyclical:  Its  correlations  with  aggregate  shocks  and  aggregate  investment 
are,  respectively,  0.79  and  0.89  for  equipment,  and  0.72  and  0.39  for  structures.^* 

There  are  traces  of  the  cyclical  features  of  our  nonlinear  model  in  our  out-of-sample 


Evaluated  at  vt- 

28^ 


The  correlations  with  lagged  investment/capital  ratios  are  0.60  and  0.13,  for  equipment  and  structures, 
respectively. 
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Table  6:  Sectorial  Average  MSE  for  1-Step-Ahead  Forecasts:  1988-92 


Equipment 

Structures 

Sector 

AR(2) 

MODEL 

ln((l)/(2)) 

AR(2) 

MODEL 

ln((l)/(2)) 

20 

0.591 

0.780 

-0.425 

0.742 

0.150 

1.571 

21 

0.085 

0.084 

0.003 

0.117 

0.150 

-0.233 

22 

0.343 

0.310 

0.091 

0.221 

0.120 

0.612 

23 

0.243 

0.110 

0.781 

0.278 

0.094 

1.088 

24 

0.160 

0.240 

-0.405 

0.342 

0.250 

0.333 

25 

0.083 

0.092 

-0.105 

0.560 

0.260 

0.760 

26 

0.034 

0.068 

-0.707 

0.633 

0.400 

0.456 

27 

1.620 

1.710 

-0.055 

0.386 

0.720 

-0.619 

28 

0.129 

0.180 

-0.341 

0.313 

0.400 

-0.247 

29 

0.098 

0.100 

-0.036 

0.628 

0.570 

0.097 

30 

0.272 

0.260 

0.035 

0.417 

0.088 

1.560 

31 

0.082 

0.089 

-0.097 

0.063 

0.054 

0.159 

32 

1.433 

1.000 

0.357 

1.483 

1.000 

0.392 

33 

0.078 

0.076 

0.023 

0.017 

0.024 

-0.352 

34 

1.739 

1.410 

0.211 

0.201 

0.270 

-0.288 

35 

0.551 

0.730 

-0.276 

0.353 

0.400 

-0.127 

36 

0.506 

0.420 

0.181 

0.328 

0.390 

-0.170 

37-1 

0.310 

0.220 

0.338 

0.099 

0.150 

-0.411 

37 

0.241 

0.270 

-0.124 

0.494 

0.075 

1.888 

38 

0.221 

0.200 

0.120 

0.184 

0.210 

-0.124 

39 

0.099 

0.072 

0.319 

0.162 

0.100 

0.467 

ln(S,(l)/S.(2)) 

- 

- 

0.047 

- 

- 

0.312 

Median 

0.241 

0.220 

0.003 

0.328 

0.210 

0.159 

The  parameters  were  estimated  using  data  up  to  1987.  MSEs  are  multiplied  by  10 
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Table  7:  Standardized  Difference  in  Absolute  Value  of  Forecast  Errors  (AR2-ModeI) 


Rel.  to  Median 

Sort  by  v 

Equipment 

Sort  by  y     Sort  by  ^ 

Sort  by  v 

Structures 
Sort  by  y 

Sort  bv  ^ 

dv 

Below 
Above 

-0.286 
0.255 

-0.189           -0.043 
0.185             0.155 

-0.056 
0.158 

0.100 
-0.033 

-0.013 
0.092 

forecasts  as  well.  The  MSE  gains  of  our  model  over  the  linear  AR(2)  is  particularly  pro- 
nounced during  periods  of  high  activity.  To  show  this,  we  proceed  in  three  steps:  First,  we 
construct,  for  each  sector,  a  standardized  series  of  the  difference  of  the  absolute  values  of 
the  forecast  error  of  the  AR(2)  and  the  structural  model,  A|e-^|: 


\Jar 

^I'^id  — 


Jr., 


cr. 


A./ 


where  e^  denotes  the  forecast  errors  (ar  and  nl  stand  for  AR(2)  and  nonlinear  model, 
respectively),  and  a^j  denotes  the  standard  deviation  of  the  difference  in  the  absolute 
value  of  the  forecast  errors.  Second,  for  each  sector,  we  sort  these  standardized  series  by 
one  of  the  following  indicators  of  activity:  the  sectoral  aggregate  shock,  u,  the  level  of  the 
sectoral  investment/capital  ratio,  y,  and  the  sectoral  index  of  responsiveness,  dy/dv.  And 
third,  we  average  across  sectors  the  sorted  standardized  series. 

Table  7  reports  these  averages  for  times  when  the  sorting  variable  was  below  its  median 
(below)  and  above  its  median  (above).  With  only  one  exception,  all  the  entries  suggest 
that  a  substantial  fraction  of  the  better  performance  of  the  nonlinear  model  comes  from 
periods  when  the  sectoral  indicators  of  activity  (shock,  investment  and  sensitivity  index) 
are  high.  For  example,  we  find  that  periods  when  aggregate  shocks  are  below  their  median, 
achieve  a  forecasting-performance  improvement  which  is  0.286  standard  deviations  lower 
than  the  average  MSE  gain  for  equipment,  while  it  is  0.056  standard  deviations  lower  for 
structures.  Conversely,  when  aggregate  shocks  are  above  their  median,  the  forecasting- 
performance  improvement  is  0.255  standard  deviations  higher  than  the  average  MSE  gain 
for  equipment,  while  it  is  0.158  standard  deviations  higher  for  structures. 
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4     Final  Remarks 

In  this  paper  we  derived  and  estimated  a  time  series  model  of  sectoral  investment  that 
builds  from  the  realistic  observation  that  lumpy  adjustments  play  an  important  role  in  firms' 
investment  behavior;  but  that  allows  for  the  empirically  appealing  feature  that  adjustments 
do  not  need  to  be  of  the  same  size  across  adjusting  firms  and  for  a  firm  over  time. 

Using  a  nonlinear  aggregate  time  series  procedure,  we  estimated  the  distributions  of 
fixed  adjustment  costs  faced  by  firms.  The  adjustment  hazards  implied  by  our  estimates 
are  nonconstant:  they  leave  a  significative  range  of  inaction,  and  increase  sharply  there- 
after. Depending  upon  the  history  of  shocks,  the  estimated  hazards  have  the  potential 
to  magnify  or  dampen  the  response  of  investment  to  aggregate  shocks.  The  passivity  of 
normal  times  is,  occasionally,  more  than  offset  by  the  brisk  response  to  large  — current  or 
accumulated —  shocks.  These  nonlinearities  clearly  improve  the  aggregate  performance  of 
dynamic  investment  equations. 

Both  the  microeconomjc  as  well  as  the  aggregate  implications  of  the  estimated  model  are 
largely  consistent  with  the  establishment  level  evidence  presented  by  Caballero,  Engel  and 
Haltiwanger  (1995)  for  U.S.  manufacturing  (for  the  1972-88  period).  They  found  evidence 
of  an  increasing  hazard  for  the  range  of  disequilibria  where  establishments  spent  most  of 
their  time.  More  importantly,  they  also  found  an  important  role  for  the  cross  sectional 
density  of  capital  imbalances  in  explaining  changes  in  the  marginal  response  of  aggregate 
investment  to  aggregate  shocks. 

In  the  processes  of  assessing  the  contribution  of  the  model,  we  found  an  important 
forecasting  gsi]n  over  simple  linear  models.  In  almost  every  year  and  sector,  and  particularly 
so  for  structures,  the  nonlinear  model  reduced  the  Mean-Squared-Error  by  a  substantial 
amount. 

Beyond  the  current  paper,  there  are  three  extensions  and  robustness  issues  worth  men- 
tioning at  closing:  First,  the  nonlinear  model  has  non-trivial  implications  for  forecasts' 
higher  moments.  Our  preliminary  exploration  of  this  issue  reveals  that  the  standard  de- 
viation, skewness  and  excess  kurtosis  of  investment/capital  ratios'  forecasts,  are  highly 
correlated  with  the  business  cycle.  These  relations  hint  at  a  promising  structural  avenue 
to  explore  movements  in  forecasts'  higher  moments. ■^^ 

Second,  in  the  working  paper  version  of  this  paper  (Caballero  and  Engel  1994)  we 


A  good  complement  for  non-structural  ARCH  models,  for  example. 
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allowed  for  serial  correlation  in  the  rate  of  growth  of  aggregate  frictionless  capital  (the  u.t's) 
and  found  very  little  of  it,  which  provides  support  for  our  i.i.d.  assumption  in  the  theory 
section.  Interestingly,  the  linear  (PAM)  model  we  estimated  left  plenty  of  unexplained  serial 
correlation,  especially  for  equipment  investment. 

Finally,  and  also  reported  in  Caballero  and  Engel  (1994),  we  extended  the  theory  and 
empirical  sections  to  acknowledge  the  existence  of  continuous  "maintenance"  investment, 
which  does  not  require  paying  sizeable  adjustment  costs.  We  found  that  while  such  an  al- 
lowance was  important  to  obtain  a  more  realistic  distribution  of  observed  changes  in  capital 
and  average  investment  rates  (in  particular,  small  changes  account  for  a  significant  frac- 
tion of  microeconomic  investment  changes),  it  did  not  diminish  the  role  of  microeconomic 
lumpiness  in  accounting  for  the  dynamic  aspects  of  aggregate  investment. 
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Appendices 


A.:  Dynamic  Optimization 

In  this  appendix  we  study  the  firm's  stochastic  dynamic  optimization  problem.  In 
section  1  we  show  existence  and  uniqueness  of  the  solution  to  the  firm's  Bellman  equation. 
In  section  2  we  study  the  main  properties  of  the  optimal  policy  function. 

1      Existence  and  uniqueness 

From  the  main  text  it  follows  that  the  Bellman  equation  -  for  the  firm's  value  function 
normalized  by  frictionless  capital  -  is  given  by:"^*^ 

v'{z,u;)  =  m&x(Tr{z  +  i)-L^^e'^'{i^  0}  +  i'  f  f  e'^'v'iz  +  i  +  Az,ij')dF{Az)dGioj')\  , 

with  {i  7^  0}  denoting  an  indicator  function  that  takes  the  value  1  when  the  firm  adjusts 
its  capital  stock  and  zero  otherwise. 

The  operator  associated  with  the  above  equation  is  not  bounded  from  below.  For  this 
reason  we  add  a  term  to  v'{z,iv)  which  does  not  depend  on  the  choice  variable  and  therefore 
does  not  affect  the  firm's  optimal  choice,  but  which  bounds  the  corresponding  operator: 

(21)  viz,^)  =  v'{z,u)+^ue^'. 

Substituting  (21)  into  the  expression  above,  using  the  expression  for  n{z)  derived  in 
section  2  of  the  main  text,  and  performing  some  straightforward  (but  tedious)  calculations, 
leads  to  the  Bellman  equation  for  v{z,i^'): 

(22)  v{z,u)  =  max{ci(Lj)e^^  -  C2e'  +  'A  //  e-^'v{z  +  Az,u')dF{Az)dG{u'),    ' 


maXi 

with 


u    cae^"  -  C2e"  +  ^  I f  e'^'viu  +  Az,io')dF{Az)dG{J) 


}, 


c,{u)     =     ^  (l  +  u- -  ^E  [e^^^]  ;.^)  , 


All  terms  that  are  not  explicitly  defined  in  what  follows  were  defined  in  section  2  of  the  main  text. 
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C2       =      e/?, 

C3     =     ^  (l  -  VE  p^^] /i.)  , 

where  /^i^  denotes  the  mean  of  the  distribution  of  the  adjustment  cost  factor  G{io). 

To  show  that  (22)  has  a  unique  solution  we  make  the  following  three  assumptions  and 
prove  the  following  two  lemmas.  The  assumptions  hold  throughout  the  remainder  of  this 
appendix. 

Assumption  1  The  adjustment  cost  factor,  u),  is  bounded  from  above  by  uj  <  +00  and 
from  below  by  0. 

Assumption  2  VE  [e-'^'l  =  jp  J e-'^'dF{Az)  <  l.^' 

Assumption  3  ^pfi^^E  \e^^^\  =  TpfJ-u  f  ^^^'dF{Az)  <  1,  where  11^  denotes  the  mean  of  the 
distribution  of  u. 

Lemma  Al 

Let  J(z)  =  k-^e^'  -  k2e\  with  0  <  /3  <  1  and  /c] ,  A:2  >  0.  Denote  z^  =  \og{ki0/k2)/{l  - 
P).  Then  /(-)  is  increasing  for  z  <  Zf,j,  decreasing  for  z  >  z,\},  and  attains  its  maximum 
value,  which  is  equal  to  (1  -  P)k\^^^~^\p/k2)'^^^^-'^K  at  z  =  z^. 

Proof  Elementary  calculus.     | 

Lemma  A2 

Consider  the  operator  T  defined  by  posing  {Tv){z,uj)  equal  to  the  right  hand  side  of  (22). 
This  operator  is  defined  on  the  set  B  of  all  real-valued,  bounded,  continuous  functions  with 
domain  M  x  [0,u;]. 

Then  T:  (i)  preserves  boundedness;  (ii)  preserves  continuity;  and  (iii)  satisfies  Black- 
well's  conditions. 

Proof 

(i)  Consider  u  ^  B,  bounded  from  below  by  u  and  from  above  by  u.   Then  (Ttz)(z,cj)  is 


In  the  main  text  we  assume  that  Az  follows  a  Normal  distribution,  with  mean  n  and  variance  a  . 
Then  this  condition  is  equivalent  to  fi  >  ^a^  +  log(V')  which  for  r  -f  6  <<  1  corresponds  approximately  to 
^i  >  ^cr  —  T  —  b.  Thus,  for  the  set  of  parameters  we  use  in  the  empirical  section,  a  sufficient  condition  is 
that  /J  >  —0.15  for  equipment  and  /j  >  —0.10  for  structures.  Both  conditions  can  be  expected  to  hold. 
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bounded  from  above,  since: 


< 

V'E 

e-^^ 

u  +  max  |ci(w)e^^  - 

C2e' 

max 

w 

< 

V-E 

e-^-' 

tl  +  max  fc,(u;)e^"'  - 

-  C2e^ 

< 

xPE 

'e-^'~ 

u  +  ^{l-P)[l+uJ 

-  VE 

g^Az' 

cae^ 


C2e 


1} 


1/(1-/3) 


where  the  second  inequality  follows  from  the  fact  that,  for  all  w  >  0,  C3  <  Ci{lo),  and  the 
last  inequality  from  Lemma  Al. 

That  (Tu)(2,cj)  is  bounded  from  below  follows  from: 


u  +  max  I  c\{uj) 
u  +  max 


,Pz 


€26   ,  max   036 


{Tu){z, ij)     >     ^E[e-^^ 
>     i'E  [e-^' 

^-^-'1  H  +  ^1  - /?)  (1  -  ^E  p^''] /z.) 


oP^ 


c.e-]} 


cae^'^  -  C2e' 


V'E  e 


1/(1-^) 


where  we  used  03  <  ci{u>)  in  the  second  step,  and  Lemma  Al  in  the  last  step. 

(ii)  To  show  that  the  function  {Tu)  is  continuous  for  u  G  ;B,  we  note  that  from  (22)  it 
follows  that  {Tu){z,Lo)  is  the  maximum  of  two  functions;  the  first  inherits  continuity  from 
u(z,u>)  and  the  second  is  constant.  It  follows  that  (Tu)  is  continuous. 

(iii)  To  show  that  T  satisfies  Blackwell's  conditions,  we  first  note  that  if  Ui,U2  G  B,  and 
u\(z.u})  <  U2{z,u))  for  aU  z  and  u),  then  the  expected  value  of  any  positive  random  variable, 
in  particular  e~^~,  preserves  the  above  inequality.  Thus: 

{Tu,){z,oj)<{Tu2){z,u). 

A  straightforward  calculation  shows  that,  for  any  u  £  B  and  any  constant  a: 


{T[u  +  a])(;,u;)  =  {Tu)iz,u)  +  0E  [e-^^]  a. 


The  second  Blackwell  condition  follows  from  Assumption  2.     | 

Proposition  Al 

Equation  (22)  has  exactly  one  solution  (and  this  solution  belongs  to  B). 

Proof 
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It  follows  from  Lemma  A2  that  T  defines  a  contraction  mapping  on  the  metric  space  B 
(normed  with  the  sup-norm).  The  modulus  of  the  contraction  mapping  is  i/'E  e~^^  . 
Existence  and  uniqueness  of  a  solution  to  (22)  now  follows  from  the  Continuous  Mapping 
Theorem  (see,  e.g.,  Theorem  3.2  in  Lucas  and  Stokey  (1989)).     | 

2      Properties  of  the  optimal  policy 

We  define  the  following  functions  related  to  the  solution  of  the  Bellman  equation,  v{z,u>), 
considered  in  the  preceding  section: 

(23)  I{z)     =     i)  f  f  e-^'v{z+  Az,oj)dF{Az)dG{u), 

(24)  Jiz)     =     C3e^' -  C2e' +  I{z). 

Lemma  A3 

The  function  J{z)  is  bounded  from  above,  i.e.,  sup,  J{z)  is  finite.  We  denote  this  supremum 

by   Jmax- 

Proof 

Since  v{z,uj)  satisfies  Bellman's  equation,  we  have: 

(25)  v{z,u>)  =  max{^we^*  +  J{z) ,  max  J(u)}. 

If  max,  J(;)  were  not  finite,  we  would  have  that  v[z,ijj)  is  not  bounded,  contradicting 
Proposition  Al.  It  follows  that  sup,  J(c)  is  finite.     | 

Lemma  A4 

The  function  J{z)  satisfies: 

(26)  lim    J{z)     =     J^ax^E  [e-^^l  , 

Z——00  I  J 

(27)  lim    J{z)e-'     =     -C2. 

2— >-|-CX> 

Proof 

From  (25)  it  follows  that: 

lim     v{z,uj)  =  Jmax, 
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which,  from  (23),  implies  that 

lim      I{z)  =  JmaxV'E 


e-^^ 


Expression  (26)  now  follows  from  (24). 

Expression  (27)  holds  because  of  (24)  and  the  fact  that  I{z)  inherits  boundedness  from 
v{z,uj).     I 

The  next  result  establishes  the  existence  of  a  curve  in  (z,w)-space  partitioning  this 
space  into  two  regions;  one  where  firms  adjust  their  capital  stock  and  another  where  they 
remain  inactive. 

Proposition  A2 

Define: 

(28)  n{z)  =  r'iJm..-J{z))e-f'\ 

Then  firms  adjust  when  their  current  adjustment  cost  factor,  w,  is  smaller  than  fi(r),  and 
remain  inactive  when  w  >  ^{z). 

Proof 

By  equating  both  terms  on  the  right  hand  side  of  (25)  we  obtain: 

(29)  ^n(z)^{J^,,-J{z))e-^\ 

The  inequalities  that  hold  for  u>  larger  and  smaller  than  ft{z)  follow  trivially.     | 

Proposition  A3 

The  function  fl{z)  is  analytic  on  the  real  line,  and  therefore  has  derivatives  of  all  order. 

Proof 

From  the  definition  of  fl{z)  and  J{z)  (see  (28)  and  (24))  we  have  that  it  suffices  to  show 
that  I{z)  is  analytic.  To  do  this,  we  note  that  I{z)  may  be  written  as  the  convolution  of  a 
normal  density  and  a  continuous,  bounded  function: 


/(z)  =  i^  f  K{z  +  Az)dF{Az), 


When  uj  =  Q{z)  firms  are  indifferent  between  adjusting  and  not  adjusting. 
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with 

K{z)     =      /  v{z,u))dG(u>), 

sl- 
ip    =     -^le  2     >^  ^ 

and  dF{Az)  a  normal  density  with  mean  Jl  =  /i  -  a"^  and  variance  cr^  (where  dF{Az)  is 
normal  with  mean  /i  and  variance  <7^). 

That  the  convolution  of  a  norma!  density  and  an  integrable  (in  particular,  a  bounded, 
continuous)  function  is  analytic,  follows  from  a  well  known  property  of  the  exponential 
family  of  distributions  (see  Theorem  9  on  p.  59  in  Lehmann  (1986)).     | 

Proposition  A4 

As  z  tends  to  -oo:'*'^ 


e-'\ 


(30)  ^fi(z)~  J„,,,{l-0E[e-^'']} 

and  as  z  tends  to  oc: 

(31)  ^fi(.-)  ~  C2e('-'3)^ 

It  follows  that: 

(32)  lim    n{z)  =  00. 

|j|  — oo 

PliOOF 

Expression  (30)  follows  from  (28)  and  Lemma  A4.  Expression  (31)  follows  from  (27).  Then 
lini|.|_,^  n(r)  =  CO  then  follows  from  (30),  (31),  Assumption  2  and  the  fact  that  /3  <  1.     | 

Proposition  A5 

The  set  C  of  r  £  iR  such  that  J{z)  =  Jmax  is  a  non-empty  set  with  a  finite  number  of 
points. 

Proof  Continuity  of  J{z)  (it  is  analytic,  see  the  proof  of  Proposition  A3)  and  Lemma 
A4  combined  with  Assumption  2  ensure  the  existence  of  a  bounded,  closed  set  Q  within 
which  J{z)  attains  its  maximum.  Continuity  of  J{z)  on  the  compact  set  Q  ensures  that  the 
maximum  is  indeed  attained  and  therefore  C  is  non-empty.  Finally,  since  J{z)  is  analytic 
we  have  that  its  maxima  are  isolated,  thus  showing  that  C  contains  a  finite  number  of 


'We  write  a(z)  ~  6(2)  as  :  tends  to  c  if  lim^^c  Inj  =  1- 
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elements.     | 

Proposition  A6 

When  adjusting  its  capital  stock,  a  firm's  optimal  choice  of  z  is  any  element  in  C.  Thus  its 
disequilibrium  after  adjusting  does  not  depend  on  its  disequilibrium  before  adjusting. 

Proof 

The  result  follows  from  the  fact  that  when  the  maximum  between  both  terms  in  the  right 
hand  side  expression  of  (25)  is  attained  at  the  second  term,  this  expression  does  not  depend 
on  z.     I 

All  calculations  of  ^(z)  performed  while  estimating  the  distribution  of  the  adjustment 
cost  factor  (see  section  3.3  in  the  main  text  and  Appendix  C)  led  to  a  set  C  with  a  unique 
element,  c,  and  a  function  n(~)  that  is  decreasing  to  the  left  of  2  =  c  and  increasing  to 
the  right  of  2  =  c,  therefore  implying  an  optimal  policy  of  the  {L,c,U)  type.  Yet  we  have 
been  unable  to  show  formally  that  C  has  one  element  (i.e.  a  unique  return  point),  and  also 
have  not  shown  formally  that,  conditional  on  u,  the  firm's  optimal  policy  is  of  the  {L,c,U) 
type.  As  the  following  proposition  shows,  however,  we  can  prove  that  the  latter  holds  in  a 
neighborhood  of  a  return  point. 

Proposition  A7 

For  (cj,  z)  in  a  neighborhood  of  (0,  c),  the  optimal  policy  is  of  the  {L,c,  U)  type. 

Proof 

From  Proposition  A5  it  follows  that  there  exists  a  neighborhood  V  of :;  =  c  such  that  J{z) 
is  decreasing  to  the  left  of  c  =  c  and  increasing  to  the  right  of  ;  =  c.  It  then  follows  from 
equation  (28)  that  ^{z)  is  decreasing  to  the  left  of  r  =  c  and  increasing  to  the  right  of 
z  =  c.  Thus,  for  all  ui  <  max-gv'('^ma.x  -  J{~))p-~^'  and  r  £  V  we  have  that  the  optimal 
policy,  conditional  on  the  current  adjustment  cost  factor,  is  of  the  {L,c,U)  type.     | 

B.:  Aggregation 

This  appendix  is  divided  into  three  sections.  In  Section  1  we  establish  the  exact  expres- 
sion for  aggregate  investment  and  present  the  results  of  simulations  to  assess  the  quality 
of  the  approximation  we  use.  In  Section  2  we  study  the  main  properties  of  the  adjustment 
hazard.  In  Section  3  we  characterize  the  average  cross-section  of  firm  deviations. 

The  following  operators,  defined  on  the  set  of  probability  measures  on  the  real  line,  are 
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used  in  sections  1  and  3  of  this  appendix  and  in  Appendix  C. 

•  Sectoral  (aggregate)  shock,  A{v):  shifts  a  cross-section  by  v. 

•  Adjustment  shock,  HiO):  applies  adjustments  determined  by  the  adjustment  func- 
tion characterized  by  the  parameter  vector  6. 

•  Idiosyncratic  shock,  I{o^)-  convolves  the  probability  measure  with  a  Normal  den- 
sity with  zero  mean  and  variance  cr^. 

•  Full  cycle  of  shocks,  T{v  +  S,9,a):  equal  to  the  combination  of  the  three  shocks 
defined  above,'^''  i.e.,  to  I{(7)H{d)A{v  +  6).  Equation  (17)  in  the  main  text  gives  an 
explicit  expression  for  the  cross-section  that  results  from  applying  T  to  f{x,t  —  1). 

•  Aggregate  investment  functional,  y-.  Assigns  to  a  cross-section  the  average  in- 
vestment rate  that  results  after  adjustments  take  place  (see  equation  (12)  in  the  main 
text). 

1      Aggregate  Investment 

In  this  section  we  derive  the  exact  expression  for  aggregate  investment,  of  which  equa- 
tion (12)  is  an  approximation.  We  also  assess  the  quality  of  this  approximation. 

Lemma  Bl 

We  introduce  the  following  notation: 

•  x,,i:  disequilibrium  immediately  before  period  t  adjustment  of  firm  i. 

•  r,  (!  number  of  periods,  as  of  time  t,  since  firm  i  last  adjusted.''^ 


/,  (!  last  time  firm  i  adjusted  (equal  to  i  —  r, 


•  A:'(_^:  desired  level  of  (log)  capital  of  firm  i  immediately  before  its  last  adjustment 
took  place. 

•  ktj^^:  firm  t's  (log)  capital  stock  immediately  after  the  last  time  it  adjusted.  Note 
that  /:,  (_  J  —  k'l^  ^  +  c  and  that,  as  changes  in  capital  since  time  /,,(  have  only  reflected 
depreciation,  this  quantity  completely  determines  the  current  capital  stock. 

Incorporating  depreciation. 
^^Possible  values  are:  1,2,3 
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Then,  conditional  on  /,,(,  we  have  that  i,t  and  k^j^  ^  are  independent.  That  is,  conditional 
on  when  the  firm  last  adjusted,  its  current  disequilibrium  and  current  capital  stock  are 
independent. 

Proof  We  have  that  x,t  =  k* ^  -  A;*;  _  and  therefore  depends  on  shocks  (aggregate  and 
idiosyncratic)  that  took  place  during  periods  t  >  In-  On  the  other  hand,  A;,  ;^  ^  depends  only 
on  shocks  that  took  place  at  t  <  /,,(.  Since  shocks  are  i.i.d.,  it  follows  that,  conditional  on 
/,■  (,  both  quantities  are  independent.     | 

Proposition  Bl 

Denote: 

•  Kt{x\r):    average  capital  stock  of  plants  with  disequilibrium  x  at  time  t,  that  last 
adjusted  r  periods  ago. 

•  h'tir)  :  average  capital  stock  of  all  firms  that  last  adjusted  r  periods  ago.  Thus: 

A-,^  =  ^7r,(r)r,(r). 

r 

•  /((r):  average  investment,  at  time  /,  of  those  that  last  adjusted  r  periods  ago.  Thus: 

r 

•  f'{x\r):    cross-section,  at  time  t,  of  plants  that  adjusted  r  periods  ago.    Denoting 
Tt  =  n(e)A(vt  +  S)I{a1)  and  U  =  A{vt  +  <5)I((t2),  we  have  that 

•  7r((r):  fraction  of  plants,  as  of  time  t,  that  last  adjusted  at  r  periods  ago. 

Then: 

it  =  E'^'(^)^(('-)/(e-"  -  l)K{x)ft{x\T)dx. 


Proof 


It"     =     E''tir)It{r) 
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E'^'M/c^ 


(e-^  -  l)A{x)Kt{x\r)f;(x\r)dx 
^     E  Mr)Kt{T)  /(e-^  -  l)Aix)f;ix\r)dx, 

where  we  used  Lemma  Bl  in  the  last  step.     | 

It  follows  that,  to  calculate  the  exact  expression  for  the  aggregate  investment/capital 
ratio,  we  need  to  keep  track  of  a  sufficiently  large  number  of  conditional  cross-sections, 
ft{x\r)  and  the  size  distribution  of  cohorts,  7rt(r).  Computationally,  this  is  substantially 
more  burdensome  than  the  approximation  we  used.  We  show  numerically,  however,  that 
this  appro.xjmation  mostly  affects  nuisance  (secondary)  parameters. 

We  note  that  if  7  denotes  the  estimated  values  of  the  main  parameters  7,''^  then  a  linear 
transformation  of  the  shocks  used  when  calculating  the  likelihood  still  leads  to  the  same 
main  parameters. ~^"  It  follows  that  a  good  measure  of  the  quality  of  the  approximation  we 
use  is  to  determine  the  extent  to  which  the  exact  expression  comes  close  to  our  approx- 
imation when  we  allow  for  a  linear  change  in  the  aggregate  shocks  that  determine  exact 
aggregate  investment. 

To  implement  this  idea  we  consider  50.000  firms  with  initial  capital  stock  equal  to  one 
and  disequilibrium  x  equal  to  zero.  All  firms  belong  to  the  same  sector.  We  simulate  the 
evolution  of  these  firms  during  75  time  periods,  with  parameters  given  by  our  estimated 
structural  model.  We  keep  track  of  the  aggregate  shocks  (denoted  by  Vt)  and  our  approx- 
imation to  aggregate  investment  (denoted  by  yf).  Next  we  re- run  the  whole  process  with 
rescaled  aggregate  shocks  (if,  =  a  +  bvt),  this  time  keeping  track  of  the  exact  expression  for 
aggregate  investment  (y^(a,6)).  We  find  the  values  of  a  and  b  for  which  the  series  yj{a,b) 
is  closest  to  y^ .  To  measure  pro.xjmity  between  both  series  we  consider  two  criteria,  both 
of  them  applied  to  the  last  45  observations  of  both  series: 

^2     ^     ^_MSiyJ(a^b)-yf) 


R2     =     1- 


y^r{yf) 

V^T{yJ(a,b)-yf) 

Var(t/^) 


There  are  three  of  these  parameters  in  both  estimation  approaches  we  use.  In  both  cases  we  have  a  free 
constant.  The  two  remaining  parameters  characterize  the  distribution  of  adjustment  costs  in  the  structural 
case  and  the  adjustment  hazard  in  the  semistructural  case. 

This  follows  from  the  expression  derived  for  the  likelihood  (see  Appendix  C). 
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Table  8:  Assessing  the  approximation  for  aggregate  investment 


Equipment 

Structures 

a 

0.0153 

0.0030 

b 

1.054 

1.080 

R? 

0.9803 

0.9949 

d 

-0.0185 

-0.0045 

b 

1.225 

1.145 

R2 

0.9962 

0.9980 

where  MS(j/()  denotes  the  average  of  the  squares  of  the  corresponding  series  and  Var(2/t) 
its  variance.  These  measures  capture  the  fit  of  a  regression  of  yj  on  y^ ^  differing  in 
whether  they  allow  or  not  for  an  additive  constant  term."^^  Table  8  shows  the  results  of 
our  simulations.  It  is  apparent  that  the  excellent  quality  of  the  fit  we  obtained  justifies 
approximating  aggregate  investment  by  (12). 

2      Adjustment  hazard 

In  this  section  we  study  the  properties  of  the  adjustment  hazard: 

A(x)  =  G(fi(i  +  c)), 

where  c  is  a  fixed  element  (say  the  smallest  one)  in  the  set  C  characterized  in  Proposition 
A5. 

Assumption  4  The  distribution  function  G  has  a  continuous  density  g(w)  with  support 

[0,w],  w  <  CO. 

Proposition  B2 

Under  Assumptions  1,  2  and  3  made  in  Appendix  A  we  have  that  the  adjustment  hazard 

satisfies: 

(a)      lim|^|^oo  A(2;)  =   1.  Furthermore,  there  exists  a  positive  constant  M  such  that  for 

|a;|  >  M  we  have  A(a;)  =  1. 


It  is  arguable  which  criterion  is  more  adequate  in  our  case.    On  one  hand,  we  allow  for  an  additive 
constant  term  when  estimating  our  models,  on  the  other  hand,  it  did  not  vary  across  sectors. 
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(b)     A(a;)  is  difFerentiable  at  all  x  and: 

A'{x)  =  g(a{x  +  c)}n'{x  +  c). 

Proof 

(a)  This  follows  immediately  from  Proposition  A4  and  Assumption  1. 

(b)  It  follows  from  the  fact  that  A(x)  is  the  composition  of  two  differentiable  functions  and 
therefore  differentiable  (see  Proposition  A3).     | 

3      Invariant  distribution 

Due  to  the  presence  of  aggregate  shocks,  the  distribution  of  disequilibria  that  determines 
aggregate  investment  (see  Section  2.3.3  of  the  main  text)  has  no  invariant  distribution.  In 
Caballero  and  Engel  (1992b)  we  establish  that,  in  a  well  defined  sense,  the  average  over  all 
possible  trajectories  of  aggregate  shocks  of  the  cross-section  of  deviations  is  equal  to  the 
invariant  distribution  faced  by  an  individual  firm.''^  In  this  section  we  show  that  such  a 
distribution  exists  and  that  convergence  toward  it  takes  place  at  an  exponential  rate. 

The  following  operators,  all  of  which  are  defined  on  the  set  T  of  probability  densities 
on  the  real  line,  will  be  useful  throughout  this  section. 

•  We  let  7]  =  y4(/j  +  h)I{a'^)7i{e)  and  Tj  =  'H{B)A{^i  +  S)I{a'^),  where  fi  denotes  the 
mean  of  aggregate  shocks,  6  the  depreciation  rate,  a^  the  sum  of  the  variance  of 
aggregate  and  idiosyncratic  shocks,  and  9  the  set  of  parameters  characterizing  the 
function  A(x)  =  G{n{x  +  c)).  The  operators  A,  H  and  I  were  defined  earlier  in 
this  appendix.  For  any  integer  n  >  1  we  denote  by  7^"  the  n-fold  composition  of  7i, 
2  =  1,2. 

•  We  denote  by  V(/)  the  function  that  associates  to  /  the  fraction  of  firms  that  adjust 
when  applying  72. 

•  We  denote  by  Gn-\  the  operator  that  associates  to  an  initial  density  /  the  cross-section 
of  firms  that  have  not  adjusted  after  {n  -  1)  shocks,  normalized  to  one. 


mean  and  variance  a^  =  it\  +  a^ 


Thus  the  aggregate  shock  is  constant  and  equal  to  /i  and  the  idiosyncratic  shock  is  Normal  with  zero 

2 
A 
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We  cLSSume  throughout  that  shocks  are  Normal.  The  following  lemma  is  needed  to  establish 
the  main  result  of  this  section. 

Lemma  B2 

Given  a  cross-section  /o  let  /„  =  7^"(/o).  Denote  by  nn{fo)  the  probability  that  a  particular 
firm  adjusts  at  time  n,  conditional  on  not  having  adjusted  during  the  first  n  -  1  periods, 
and  by  'r„(/o)  the  fraction  of  firms  that  do  not  adjust  during  the  first  n  periods.  Then  there 
exists  a  constant  a  £  (0, 1),  common  to  all  initial  cross-sections,  such  that: 

(33)  ^n(/o)     >     1  -  Q, 

(34)  r„(/o)     <     a". 

Proof  Denote  by  J^n-i  the  set  of  all  densities  that  may  represent  those  firms  that  have 
not  adjusted  after  (n-  1)  shocks.  Since  J^„_i  C  T  and  7r„(/)  =  7ri(^„_](/)),  we  have  that: 

inf     7r„(/)     =        inf    V(g) 
>     inf  V(/) 
=     >nj^i(/)- 

Hence  a  lower  bound  for  tT]  also  is  a  lower  bound  for  7r„,  n  >  1. 

Ne.xt  note  that  it  follows  from  Proposition  B2  that  there  exists  a  constant  M  such 
that  A(x)  =  1  for  |i|  >  M .  Denote  by  A'(x)  the  adjustment  hazard  that  is  equal  to  one 
when  |i|  >  M  and  equal  to  zero  elsewhere.  It  is  ea^y  to  see  that  both  r„(/o)  and  1  — 
^i(/o)  corresponding  to  this  adjustment  hazard  are  larger  or  equal  than  the  corresponding 
quantities  for  the  original  hazard.  Tlius  it  suffices  to  prove  (33)  and  (34)  for  A'(a:)  and,  in 
the  case  of  (33),  for  n  =  1. 

Applying  the  operator  72  to  a  mass  point  at  x,  (5^,  we  have  that  the  value  of  x  for 
which  the  fraction  of  firms  that  does  not  adjust  is  largest,  is  the  value  such  that  the 
distribution  before  adjustment  is  normal  with  zero  mean  and  variance  cr^,'*°  and  this  fraction 
is  Q  =  2$(M/o-)  —  1,  with  0  <  q  <  1  and  ^  denoting  the  c.d.f.  of  a  standard  Normal.  It 
follows  that  for  any  density  /(x)  the  fraction  that  does  not  adjusts  after  applying  T2  is 
bounded  from  above  by  a.    Hence  tti  >  1  -  q.    Finally,  given  any  cross-section  /o(x),  we 


■"Since  <J>((A/  -  x)/cr)  -  $((-A/  -  x)/(t)  is  maximized  at  i  =  0. 


41 


have  that  ^^(/o)  =  n^_j(l  -  ■^kifo)),  which  is  bounded  from  above  by  q".     | 

Proposition  B3  Given  an  arbitrary  initial  cross-section,  /o,  let  fk  =  T^{fo).  Also,  define 
the  sequence  (/^)  as  above,  but  for  the  particular  case  where  /o  is  a  mass  point  at  x  =  0. 

Let  7r°  denote  the  probability  that  a  firm  which  starts  off  at  i  =  0  adjusts  at  time  n, 
conditional  on  not  having  adjusted  during  the  first  n  —  1  periods. 

Denote: 

n;:^(i-^,°) 


^'     i  +  E.>:[nL-d(i-'^?)]' 


and  define:  /'^(x)  =  J2i>oPifi{^)-   Then  /„  converges  to  /^  in  the  variation  distance  and 
convergence  takes  place  at  an  exponential  rate. 

Proof  We  consider  first  the  case  where  /o  =  Sq. 

At  any  moment  in  time,  f°  can  be  partitioned  into  groups  of  firms  which  last  adjusted 
the  same  number  of  periods  ago.  Hence  f°  is  a  convex  combination  of  /o,/i°,  •  •  • ,  fn-\- 

The  weights  on  the  above  densities  can  be  determined  by  the  one-to-one  correspondence 
with  the  Markov  process  with  state  space  S  -  {0,1,2,3,...},  and  transition  kernel: 

Pis,v)  =   (1   -  TTs)6s,v--[   +  ^s^s,0, 

where  (5,,^  =  1  if  z  =  j  and  zero  otherwise.   State  s  leads  to  s  +  1  (not  adjusting)  or  to  0 
(adjusting).  The  corresponding  probabilities  are  1  —  tt,  and  tt^. 

It  follows  from  Lemma  B2  that  the  above  process  satisfies  Condition  M  in  Lucas  and 
Stokey  (1989,  p.  348).  Indeed,  with  the  notation  of  these  authors  we  have  that  there  exists 
N  =  I  and  e  =  max(a,  1  -  q)  >  0,  with  q  defined  in  the  preceding  lemma,  such  that  for  all 
subsets  A  of  5: 

rmx{ P^  {s,  A),  P:{s,A^))  >  e, 


since 


m..iP,is,A),P,is,A^))  =\'  'f  ^°'^  +  ^^  ^  ^  °^  ^°'^  +  '^  ^  ^"^ 

1    max(7rs,  1  -  7r^)     otherwise, 

and  from  Lemma  B2  we  have  max(7rs,  1  -  tt^)  =  e  for  all  5. 

Hence  Theorem  11.12  in  Lucas  and  Stokey  (1989,  p.  350)  implies  that  there  exists  an 
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invariant  distribution,  /^,  and  a  constant  G  =  (1  -  e)  <  1,  such  that: 

(35)  II /° -n\<  G"ii fs -n\. 

Furthermore,  /^  is  the  unique  fixed  point  for  the  Markov  operator.     The  latter  and  a 
straightforward  calculation  show  that  f^  =  Y^^Pif"- 

Extending  this  results  to  the  general  case  where  /o  can  be  arbitrary  is  straightforward. 
All  firms  eventually  adjust  and,  once  they  adjust,  the  previous  case  applies  (since  they 
adjust  to  X  =  0). 

Given  an  arbitrary  /o  we  may  write: 

T"(/o)  =  (1  -  V2)r"/^(/„*)  +  V2(5n), 

where  /*  is  a  convex  combination  of  Jq,  f^,. . . ,  f",^;  gn  could  beany  cross-section,  and  r„ 
was  defined  in  Lemma  B2. 

From  Lemma  B2  and  the  fact  that  the  variation  distance  is  bounded  from  above  by  1 
we  have  that  there  exists  a  G  (0,  1)  such  that: 

iiTVo-rii  <  (i-r„/2)iiT'^/'(<5o)-rii+v2 
<  G"/'  II  /o  -  r  II  +a"/'. 

Convergence  follows  by  letting  n  tend  to  infinity;  the  rate  at  which  convergence  takes  place 
is  geometric,  bounded  from  above  by  mcLx(vG,  i/q).     | 

C:  Econometrics 

This  appendix  is  divided  into  two  sections.  In  Section  1  we  derive  the  likelihood  function 
and  sketch  the  general  approach  used  for  calculating  this  function  at  given  parameter  values. 
In  Section  2  we  describe  implementation  details  for  the  semistructural  (Section  2.1)  and 
structural  (Section  2.2)  cases. 
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1      Calculating  the  likelihood  function 

1.1      An  expression  for  the  likelihood 

The  sources  of  randomness  ("error  terms")  are  the  sectoral  (aggregate)  shocks,  i.e.,  the 
u,('s,  where  i  =  I,..., I  and  t  =  l,...,r.'"  We  assume  that  sectoral  shocks  are  Normal 
and  independent  over  time,  and  denote  the  mean  and  variance  of  these  shocks  in  sector  i 
by  ^,  and  c„,  respectively.  The  (column)  vector  of  sector  i's  aggregate  shocks  is  denoted 
by  V,;  V  denotes  the  column  vector  with  Vi  followed  by  V2  and  so  on;  and  fiy  =  ^[V]- 
We  allow  for  contemporaneous  correlation  among  shocks  from  different  sectors;  the  matrix 
C  =  [c,j]  denotes  the  corresponding  covariance  matrix  (which  does  not  vary  over  time). 

Standard  change  of  variable  calculations  lead  to  the  following  expression  for  minus  the 
log  likelihood: 

-Log.  lik.  =  const  +  ^  log  §^   +  ^  log  \C\  +  hv  -  /iv)'(C-'  ®  hW  -  ^iv), 


dvu 


where  A  Q  B   denotes  the  Kroneckcr  product  of  matrices  A  and   B  and   y,;    =    lal^'^ii- 
Concentrating  the  likelihood  with  respect  to  C  and  /iv  leads  to: 


[361 


Log.  lik.  =  const  +  y^log 


dyxi 


dv,t 


+  2l°g 


{V  -  fIv){V  -  ^TvY 


where  /I'v  corresponds  to  the  vector  of  sample  means.    That  the  Jacobian  is  well  defined 
follows  from  Proposition  B2. 

Calculating  the  likelihood  in  (36)  requires  calculating  the  sectoral  shocks  (the  v,t's)  and 
the  corresponding  partial  derivatives  (the  dyu/dv,i^s).  Next  we  show  that,  conditional  on 
the  initial  cross-section  and  the  set  of  parameter  values,  the  relation  between  sectoral  shocks 
and  sectoral  investment  rates  is  invertible.  Our  proof  is  constructive:  it  describes  how  the 
sectoral  shocks  are  actually  calculated  for  given  parameter  values. 


^'By  working  with  a  continuum  of  firms  we  have  that,  despite  the  presence  of  idiosyncratic  and  adjust- 
ment shocks  at  the  micro  level,  the  only  source  of  sectoral  randomness  are  aggregate  shocks.  That  is, 
the  cross-section  that  results  after  adjustments  is  uniquely  determined  by  the  adjustment  function  and  the 
cross-section  prior  to  adjustments.  Also,  the  cross-section  that  results  after  the  idiosyncratic  shocks  is  the 
convolution  of  the  density  (common  across  plants  and  sectors)  from  which  these  shocks  are  drawn  with  the 
cross-section  prior  to  the  shock.  See  equations  (15)  and  (16)  in  the  main  text  for  the  corresponding  formulas. 
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1.2      Calculating  the  components  of  the  likelihood 

Suppose  we  know  the  cross-sections  of  disequilibria  in  every  sector  at  time  t  =  0.  It  follows 
from  the  aggregate  dynamics  in  our  model  that  the  observed  capital-investment  ratio  in 
the  i-th  sector  during  period  t  is  determined  by  the  aggregate  shocks  in  the  first  t  periods 
(see  equations  (15),  (16)  and  (18)): 

(37)  y,t  =  yltiv^l,v,2,...,v,t,M■,0)),         i  =  l,...,/. 

Furthermore,  aggregate  investment  is  a  function  of  only  the  current  aggregate  shock 
and  the  cross-section  prior  to  this  shock: 

(38)  yu     =     y.tiM-J-l),Va) 

(39)  =      I  4>{v,t  +  S  -  x)A{x -vu-S)Mx,t-  l)dx, 

where  in  our  case  (p{u)  =  e"  -  1  but,  more  generally,  in  the  derivation  that  follows  4>{u) 
could  be  any  smooth  and  strictly  increasing  function  with  4'(0)  =  0.  The  derivative  of  the 
above  expression  with  respect  to  v,t  evaluated  at  v  is  equal  to: 


d 


^(/,(-,^-1),lO  =   /  [(h'(v  +  8  -  x)A(x  -  V  -6)  +  4>{v  +  6  -  x)A'{x  -v-  8)]  f,{x,t-l)dx. 

Vu  J 


Recalling  that  f{x,t)  denotes  the  cross  section  immediately  after  period  Vs  sectoral  (and 
depreciation)  shocks,  we  have  that: 

(40)  ^iM;t-l),Va)=   [  [<P'{-x)\(x)  -  <t>{-x)A'{x)]  Mx,t  -  l)dx. 

It  follows  from  our  assumptions  on  d  and  the  fact  that  A(z)  is  decreasing  for  negative  x 
and  increasing  for  positive  x  (the  "increasing  hazard"  property)  that  the  above  derivative 
is  strictly  positive  when  /,(i,  ^  -  1)  has  support  equal  to  the  real  line  (as  in  our  case).  Thus 
Vii  is  uniquely  determined  from  (38): 

(41)  vu     =     f.((2/.r,/,(-,^-  1)) 

(42)  =     Vaiy^t,v,,t-\,■■■,Vi^■l,f,{■,0)), 
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and  proceeding  inductively  we  conclude  that: 

v,t  -  v,t{y,t,y,,t-i,--  •  ,2/.i,/.(-.0))). 

It  follows  that,  conditional  on  the  initial  cross-sections,  the  t;,('s  are  uniquely  determined 
by  the  y,('s. 

1.3  Initial  cross-sections 

The  initial  cross-section  in  sector  i  is  set  equal  to  the  invariant  probability  measure  of  the 
unconditional  process  describing  the  evolution  of  disequilibria  for  an  individual  plant  in 
that  sector.''^  This  is  the  cross-section  obtained  when  averaging  over  all  possible  sample 
paths  of  aggregate  shocks. ''■' 

Although  this  selection  is  arbitrary,  we  checked  the  robustness  of  our  results  by  study- 
ing the  convergence  properties  of  cross  sections  distribution  near  our  initial  distribution. 
We  compared  the  sequence  of  cross-sections  used  in  our  likelihood  calculations  with  those 
obtained  when  we  perturbed  the  mean  of  the  invariant-initial  distribution  by  one  standard 
deviation  of  the  average  (across  sectors)  aggregate  shocks.  The  Markov  structure  of  our 
problem,  combined  with  the  contractionary  features  derived  in  Appendix  A,  ensures  that 
for  any  given  sequence  of  aggregate  shocks,  the  distance  between  both  cross-sections  tends 
to  zero  over  time  with  probability  one;  the  issue  is  how  fast  one  distribution  converges  to 
tlie  other.  Simulations  showed  that,  for  the  parameter  values  considered,  the  distance  be- 
tween both  sequence  of  cross  sections  becomes  negligible  (variation  distance  less  than  0.01) 
sometime  between  the  second  and  third  cross-section  after  the  initial  one.''''  For  this  reason, 
we  discarded  the  first  three  observations  for  all  series  when  calculating  the  likelihood. 

1.4  Summary 

Given  a  set  of  parameter  values,  we  calculate  the  likelihood  in  (36)  as  follows: 


By  unconditional,  we  mean  that  we  do  not  condition  on  actual  sectoral  shocks.  For  this  reason  the 
variance  of  shocks  relevant  for  this  distribution  is  the  sum  of  the  variances  of  sectoral  and  idiosyncratic 
shocks.  In  Appendix  B.l  we  show  that  if  Fo  denotes  the  probability  measure  describing  a  particular  plant's 
deviation  at  time  t  =  0,  and  Ft  the  corresponding  probabihty  measure  t  periods  later,  then  Ft  converges  in 
the  variation  distance  to  a  distribution  F*  which  does  note  depend  on  the  initial  distribution  Fo. 
''^See  Caballero  and  Engel  (1992b)  for  a  proof. 
The  parameter  on  which  convergence  depends  most  is  the  variance  of  idiosyncratic  shocks;  convergence 
is  faster  as  this  parameter  becomes  larger. 
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1.  The  initial  cross-section  of  firms'  disequilibria  (one  for  each  of  the  21  two-digit  man- 
ufacturing sectors  considered)  are  set  equal  to  the  invariant  distribution  faced  by  an 
individual  plant.  These  cross-sections  are  denoted  /,(x,0);  i  =  1,...,21. 

2.  For  i  =  1  to  T: 

(a)  Solve  (38)  to  find  u,(,  z  =  1, . . . ,  21. 

(b)  Calculate  dyit/dv^t  from  (40). 

(c)  Determine  the  next  set  of  cross-sections  of  disequilibria  (the  /,(-,<)'s)  based 
upon  (15)  and  (16). 

The  next  section  provides  the  details  on  exactly  how  everyone  of  the  steps  above  are 
conducted  in  both  estimation  approaches  (semistructural  and  structural). 

2      Implementation 

2.1      Semistructural  approach 

This  approach  estimates  the  adjustment  rate  function  directly.  We  assume  that  the  adjust- 
ment rate  function  is  common  across  sectors  and  of  the  form: 

(43)  A(x)  =  1  -e-^°--^^^', 

with  Aq  >  0  and  A2  >  0.  W'e  estimate  three  parameters  (besides  the  means  and  var-cov. 
matrix  of  aggregate  shocks);  Aq,  A2  and  an  additive  constant  (common  across  sectors). 

2.1.1      Estimating  the  initial  cross-sections 

In  what  follows,  we  do  not  make  any  assumptions  about  the  mean  of  the  aggregate  shock.  If 
we  knew  this  mean,  or  could  estimate  it  directly  from  the  observed  data,  then  we  could  de- 
termine the  invariant  density  by  calculating  the  invariant  probability  function  of  a  standard- 
fixed  Markov  chain  (see  Caballero  and  Engel  (1994)  for  details). 

To  compute  the  initial  cross-section  we  proceed  as  follows.   For  sector  i  we  let  5,(1,0) 
denote  a  Normal  density  with  zero  mean  and  variance  CTq  =  cr^  +  c„.   We  set  c,,  equal  to 


47 


0.035  for  all  sectors.''^  Given  5,(1, r  -  1)  we  calculate  g,{x,T)  by  first  solving  for  v  in 

y  [niXo,  A2M(tV  +  S)]g,{x,  r  -  1)  =  y„ 

where  y;  denotes  the  average  capital-investment  ratio  of  sector  i.  The  solution  is  denoted 
by  v^r-  Then  we  set: 

5,(1,  r)  =  T{v,r  +  6,Xo,X2,cTo)g,{x,T  -  1). 

As  T  grows,  5,(a:,r)  approaches  the  unconditional  invariant  density  for  an  individual 
plant,  and  v,t  approaches  a  constant  consistent  with  the  mean  of  sectoral  investment/capital 
ratio.  We  use  this  density  as  the  initial  cross-section  when  calculating  the  likelihood. 
Simulations  showed  that  using  30  iterations  (for  each  sector)  was  sufficient  for  all  practical 
purposes. 

2.1.2      Calculating  the  likelihood 

The  family  of  adjustment  functions  we  work  with  has  the  attractive  property  that  the 
evolution  of  the  cross-sections  can  be  tracked  efficiently  using  a  convex  combination  of  a 
small  number  of  Normal  densities,  thus  reducing  computational  time  substantially.  To  see 
this,  we  show  next  that  if  we  assume  that  f,{x,t  —  1)  is  a  convex  combination  of  A'  Normal 
densities,  then  f,{x,t)  is  a  convex  combination  of  A  -|-  1  densities.  We  also  derive  simple 
expressions  to  update  the  means,  %'ariances  and  weights  assigned  to  the  Normal  densities 
characterizing  /,(x,i). 

Consider  first  the  case  A'   =   1  and  assume  f,{x,t  -  1)  is  Normal,  with  mean  fi  and 
variance  a^.    A  simple  but  tedious  calculation  shows  that  solving  (38)  reduces  to  solving 
for  i',(  in: 
(44)  y_^  =  e'='''-+*'  -  1  -  Ie"''*"'-+*'  +  ^''''"+*', 


where 


t'     = 


a 


a^ 


I  +  2A2CT2' 


There  are  two  reasons  for  fixing  c„:  First,  it  avoids  estimating  an  additional  non-linear  parameter. 
Second,  since  we  may  expect  that  the  variance  of  idiosyncratic  shocks  is  significantly  larger  than  the  variance 
of  aggregate  shocks,  the  value  of  the  latter  is  of  little  relevance  when  determining  the  adjustment  function 
and  the  invariant  density. 
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1     2 

c{v)     =     V  -  fi+  -a  , 
d{v)     =     Ao  +  — A2(v-/i) 


The  partial  derivative  in  (40)  is  equal  to: 

(45)  1^  =  e'^(^-+^)  +  4e-''<^"+*'  -  2X2{v,t  +  <5  -  /i)4 

dv,t  cr  a'- 


,cK,  +  5)  _  ^. 


It  follows  from  equation  (17)  in  the  main  text  that  the  cross-section  density  after  the 
<-th  period's  sectoral  (aggregate),  hazard  and  idiosyncratic  shocks,  f,{-,t),  is  a  convex 
combination  of  two  Normal  densities,  one  of  them  with  mean  tj  =  (fi  —  vn  —  ^)/(l  +  2A2cr^) 
and  variance  r^  +  a^^,  and  the  other  with  zero  mean  and  variance  cr^.  The  former  corresponds 
to  those  firms  that  did  not  adjust,  the  latter  to  those  that  adjusted  their  capital  stock.  The 
fraction  of  firms  in  the  group  that  does  not  adjust  is: 


(46) 


r2 

(7^ 


In  the  more  general  case,  where  f,{x,t-l)  =  '}2k=i  (^hf^(,x,i~^)  's  a  convex  combination 
of  vV  Normal  densities,  u,(  is  obtained  by  solving  an  equation  analogous  to  (44)  with  a  linear 
combination  of  terms  like  the  one  on  the  right  hand  side  of  that  equation: 

y,i  =  H  Qfc  /  (i'{vii  +  <5  -  x)A{x  -  u,(  -  S)f^{x,  t  -  l)dx. 
k         •' 

The  partial  derivative  is  equal  to  a  convex  combination  of  terms  like  those  in  (45).  We 
also  have  that  f,{x,t)  will  be  a  convex  combination  of  A''  +  1  Normal  densities.  Each  of 
these  cross-sections  corresponds  to  a  specific  cohort,  grouping  plants  that  have  not  adjusted 
for  the  same  number  of  periods.  The  "older"  cross-sections  are  more  spread  out  than  the 
"younger"  ones  and  have  lost  mass  monotonically  due  to  the  adjustment  of  their  members. 
Simulations  showed  that  keeping  track  of  30  densities  is  extremely  conservative:  the  impact 
on  aggregate  investment  of  cohorts  much  older  than  30  years  is  negligible.  For  this  reason, 
in  every  period  we  merge  the  two  oldest  cohorts  into  one  Normal  density  with  mean  and 
variance  equal  to  those  of  the  convex  combination  of  the  densities  being  merged. 
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2.2      Structural  approach 

Instead  of  estimating  the  adjustment  function  directly,  as  in  the  semistructural  case,  here  we 
estimate  the  parameters  of  the  distribution  of  adjustment  costs  and  obtain  the  adjustment 
function  from  the  solution  of  the  dynamic  optimization  problem  described  in  section  2. 

The  initial  distribution  is  calculated  in  a  way  analogous  to  the  semistructural  case. 
Adjustment  costs  are  drawn  from  a  Gamma  distribution: 


'""'= 54)  r '"''"'**• 


which  has  mean  ^i^  —  p4)  and  coefficient  of  variation  cv^j  =  l/y/p-  Again,  we  estimate  three 
parameters  (besides  the  means  and  var-cov.  of  aggregate  shocks);  /x^,  cv^  and  an  additive 
constant  (common  across  sectors). 

2.2.1      Adjustment  function 

The  adjustment  function  for  a  given  set  of  parameters  is  obtained  by  solving  numerically 
the  stochastic  dynamic  optimization  problem  described  in  section  2.  For  this  purpose  - 
but  not  when  evaluating  the  likelihood  from  the  shocks  and  Jacobian  terms-  we  disregard 
sectoral  differences  in  /x^  and  cr^,  and  assume  the  parameters  that  determine  the  adjustment 
function  {fi^,  cv^,  a^,  r,  S,  P,  in  addition  to  fi^  and  cr^  )  are  common  across  sectors.  This 
allows  us  to  calculate  only  one  adjustment  function  and  use  it  for  all  sectors. 

We  use  a  grid  of  800  equally  spaced  points  on  the  interval  [-3.5,3.5]  to  determine 
the  value  function  via  value  iteration. '^^  The  corresponding  steps,  for  which  extensive 
simulations  showed  that  30  iterations  were  sufficient,  are:"*^ 


-•     Vr._,{z  +  Az)  +  ^e^(''+^^)   /  G{u)du 


Cn     =     argmax(u„(2)). 

Using  200  points  makes  no  significant  difference;  we  used  800  because  the  additional  time  involved  was 
small.  The  reason  why  we  need  at  least  200  points  is  that  we  fix  the  grid  of  possible  values  of  x  (between 
-3.5  and  3.5)  in  advance,  so  that  often  a  significant  part  of  this  interval  becomes  irrelevant  (the  hazard  is 
almost  equal  to  one  on  it).  Also,  the  finer  the  grid,  the  closer  we  can  get  to  the  case  where  the  adjustment 
hazard  looks  like  that  of  an  (5,  s)  policy. 

See  Section  2.2.3  for  the  derivations. 
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The  distribution  of  Az  is  Normal  with  mean  ln(l  —  8)  and  variance  equal  to  the  total 
variance  faced  by  an  individual  firm  (ctq  =  a'^  +  cr^).  When  calculating  Cn  we  interpolate 
with  a  quadratic  polynomial  the  value  function  Vn{z)  at  the  3  points  on  the  grid  where 
the  function  is  largest,  and  set  c„  equal  to  the  argument  of  the  maximum  value  of  this 
polynomial.  By  doing  this  maximization  over  a  smoothed  function,  we  avoid  having  to 
work  with  a  discontinuous  likelihood  function. 

We  set  the  mean  of  the  aggregate  shocks  equal  to  the  mean  estimated  with  the  semistruc- 
tural  approach.''^ 

2.2.2      Family  of  adjustment  functions 

The  adjustment  function  estimated  via  dynamic  optimization  is  evaluated  on  a  grid  of 
800  points.  This  makes  it  computationally  infeasible  to  solve  the  945  non-linear  equations 
needed  to  calculate  the  i;,('s  in  every  evaluation  of  the  likelihood.  For  this  reason  we  work 
with  a  family  of  adjustment  functions  characterized  by  only  a  few  parameters  and  such  that 
the  derivatives  needed  for  the  Jacobian  terms  do  not  need  to  be  calculated  numerically. 

Experimentation  with  a  variety  of  distributions  of  adjustment  costs  showed  that  the 
family  of  continuous,  piecewise  inverted  Normal  adjustment  functions  approximates  well  the 
adjustment  functions  obtained  via  value  iteration.  Three  pieces  suffice  for  most  practical 
purposes,  with  the  middle  piece  equal  to  zero.  A  representative  member  of  this  four- 
parameter  family  is  of  the  form: 

1  _  e-\-(x-r-)2     if^<^-^ 

(47)  A(x)  =  <(    0  U  X-  <  X  <  x  +  , 

l_e->^(--^)^     ifx>x+. 

We  approximated  the  positive  (x  >  0)  and  negative  (i  <  0)  arms  of  the  adjustment 
function  obtained  via  value  iteration  separately.  We  determined  x'^  and  A"*"  by  imposing 
that  the  approximation  matches  the  function  obtained  via  value  iteration  at  the  (posi- 
tive) points  where  the  hazard  equals  0.25  and  0.75,  We  obtained  x~  and  A~  imposing  an 
analogous  condition  for  negative  values  of  x. 


*  We  set  this  value  ex-ante  to  avoid  having  to  estimate  additional  non-linear  parameters.  Because  of  the 
non-linear  adjustment  term  4i[v„  +  S  —  x)  in  equation  (38),  there  is  no  simple  way  to  obtain  an  estimate 
of  this  drift  from  the  data.  Also  note  that,  as  described  earlier,  when  calculating  the  invariant  density  of 
firm  deviations,  we  allow  for  a  firm  specific  mean  that  is  approximately  equal  to  the  observed  mean  of  the 
corresponding  sector. 
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2.2.3      Calculating  the  t;j('s  and  the  corresponding  derivatives 

When  keeping  track  of  the  cross-section  of  deviations,  we  approximate  /,(■,<  -  1)  by  33 
mass  points  on  a  grid  of  equally  spaced  points  (we  discuss  why  we  chose  33  points  shortly). 
We  solve  for  v  in: 

y[ni\-,X+,x-,x+)A{v  +  S)]Mx,t-l)  =  yu. 

The  partial  derivatives  are  calculated  from  (40).  Next  fi{x,t)  is  obtained  from: 

f,{xj)  =  Tiv.t  +  6,X- ,X+  ,x- ,x+  ,a,)f,{x,t  -  I). 

The  operator  A{v^t  +  ^)  is  implemented  by  shifting  the  33  mass  points  describing  fi{-,t  — 
1)  by  u,(  +  8.  The  adjustment  operator,  H,  is  applied  next,  leading  to  34  mass  points  (one 
at  each  point  where  there  was  mass  before  adjustments  and  a  new  mass  at  zero):  if  x  is  a 
point  with  mass  m(x)  on  the  pre-shock  grid,  then  after  the  adjustment  shock  we  have  mass 
(1  -  .\{x))m{x)  at  X  and  mass  A(x)m(x)  stemming  from  this  point  at  zero. 

Finally  the  idiosyncratic  shock  takes  place:  each  one  of  the  34  mass  points  becomes 
a  Normal  density  with  mean  equal  to  the  point  where  the  mass  was  located  and  stan- 
dard deviation  CT;.  The  resulting  density  is  computed  at  33  equally  space  points  on 
[//,(  -  4ct,,  ,  /i,,  4-  4cr,(],  where  /i,(  and  a1^  denote  the  mean  and  variance  of  the  cross-section 
obtained  after  the  idiosyncratic  shock. ''^  We  work  with  a  dynamic  grid  to  reduce  the  num- 
ber of  points  needed  to  track  the  cross-section.  Simulations  showed  that  33  points  on  a 
grid  of  width  equal  to  8  standard  deviations,  centered  around  the  mean,  suffices  to  obtain 
accurate  estimates. 


Both  the  mean  and  variance  are  calculated  analytically,  thereby  avoiding  any  circularity. 
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